Designing with Versal AI Engine 3: Kernel Programming and Optimization

This course, Versal® AI Engine 3, covers the advanced features of the Versal ACAP AI Engine, including debugging an application in the Vitis unified software platform, using filter intrinsics, implementing a system design in hardware, and optimizing an AI Engine kernel program.

The emphasis of this course is on describing the advanced features of the Versal ACAP AI Engine architecture, debugging applications using the Vitis unified software platform, using AI Engine filter intrinsics and programming a FIR filter using filter intrinsics, implementing system integration in hardware, and optimizing AI Engine kernels using compiler directives, programming style, and efficient movement of data.

Click here for more information about the Xilinx® Versal ACAP.

See Course Outline

2-Day Instructor-led CoursePrice USDTraining Credits
Hosted Online - $299/day$5986
In-Person Registration - $399/day$7988
Printed Course Book (A PDF book is included in the course fee)$1001
Private TrainingContact UsContact Us
Follow on CoachingContact UsContact Us

Scheduled Classes

Live Online Training (9am-5pm ET)
View our Full Calendar for class date status.
(Confirmed, Closed, Full)

Training Duration:

2 Days

Be the first to know. Sign up for our newsletter.

Who should attend:

Software and hardware developers, system architects, and anyone who needs to accelerate their software applications using Xilinx devices.

Skills Gained

After completing this comprehensive training, you will know how to:

  • Debug an application using the simulation debugging methodology and event traces
  • Identify and debug the various problems that arise in application development
  • Utilize various AI Engine kernel optimization techniques, such as compiler directives, software pipelining, coding for performance, and core utilization
  • Apply C coding guidelines for performance improvement, including function inlining, pointer restricting, and code shuffling
  • Implement an AI Engine kernel using intrinsics for a symmetric FIR with mul4_sym and mac4_sym
  • Implement an AI Engine kernel using a non-symmetric FIR with mul4_nc and mac4_nc
  • Develop and generate an AI Engine kernel using Model Composer

Course Outline

Day 1Day 2
  • Introduction to AI Engine Architecture (Review) {Lecture}
  • Versal AI Engine and Data Movement {Lecture}
  • Debugging AI Engine Applications 1 {Lecture, Lab}
  • Debugging AI Engine Applications 2 (Use Cases) {Lecture}
  • Overview of AI Engine Kernel Optimization {Lecture}
  • AI Engine Kernel Optimization Compiler Directives {Lecture}
  • AI Engine Kernel Optimization Coding Style {Lecture}
  • AI Engine Kernel Optimization {Lab}
  • Advanced C++ Kernel Programming {Lecture, Labs}
  • Data Types – Review {Lecture}
  • AI Engine Symmetric Filter Implementation {Lecture, Lab}
  • AI Engine Non-Symmetric Filter Implementation {Lecture}
  • Floating-point Operations {Lecture}
  • Model Composer for AI Engine Development {Lecture}

Please note: The instructor may change the content order to provide a better learning experience.


  • Comfort with the C/C++ programming language
  • Software development flow
  • Vitis software for application acceleration development flow
  • Designing with Versal AI Engine 1 and 2

Related Courses:

Updated 7-14-2023
©2023 Advanced Micro Devices, Inc. Xilinx, Inc. is now part of AMD. Xilinx, the Xilinx logo, AMD, the AMD Arrow logo, Alveo, Artix, Kintex, Kria, Spartan, Versal, Vitis, Virtex, Vivado, Zynq, and other designated brands included herein are trademarks of Advanced Micro Devices, Inc.