Designing with Versal AI Engine 3: Kernel Programming and Optimization

This course, Versal AI Engine 3, covers the advanced features of the Versal ACAP AI Engine, including debugging an application in the Vitis unified software platform, using filter intrinsics, implementing a system design in hardware, and optimizing an AI Engine kernel program.

The emphasis of this course is on describing the advanced features of the Versal ACAP AI Engine architecture, debugging applications using the Vitis unified software platform, using AI Engine filter intrinsics and programming a FIR filter using filter intrinsics, implementing system integration in hardware, and optimizing AI Engine kernels using compiler directives, programming style, and efficient movement of data.

Click here for more information about the Xilinx Versal ACAP.

Skills Gained

After completing this comprehensive training, you will know how to:

  • Debug an application using the simulation debugging methodology and event traces
  • Identify and debug the various problems that arise in application development
  • Utilize various AI Engine kernel optimization techniques, such as compiler directives, software pipelining, coding for performance, and core utilization
  • Apply C coding guidelines for performance improvement, including function inlining, pointer restricting, and code shuffling
  • Implement an AI Engine kernel using intrinsics for a symmetric FIR with mul4_sym and mac4_sym
  • Implement an AI Engine kernel using a non-symmetric FIR with mul4_nc and mac4_nc
  • Develop and generate an AI Engine kernel using Model Composer

Course Outline

Day 1

  • Introduction to AI Engine Architecture (Review) {Lecture}
  • Versal AI Engine and Data Movement {Lecture}
  • Debugging AI Engine Applications 1 {Lecture, Lab}
  • Debugging AI Engine Applications 2 (Use Cases) {Lecture}
  • Overview of AI Engine Kernel Optimization {Lecture}
  • AI Engine Kernel Optimization ? Compiler Directives {Lecture}
  • AI Engine Kernel Optimization ? Coding Style {Lecture}
  • AI Engine Kernel Optimization {Lab}

Day 2

  • Advanced C++ Kernel Programming {Lecture, Labs}
  • Data Types – Review {Lecture}
  • AI Engine Symmetric Filter Implementation {Lecture, Lab}
  • AI Engine Non-Symmetric Filter Implementation {Lecture}
  • Floating-point Operations {Lecture}
  • Model Composer for AI Engine Development {Lecture}

Scheduled Classes

Instructor-led Web Based Training
5/18/2021 – 5/19/2021

Instructor-led Web Based Training
6/29/2021 – 6/30/2021

Instructor-led Web Based Training
8/17/2021 – 8/18/2021

Education Investment Options

Standard Registration
Standard Registration
18 Training Credits
Advanced Registration
Advanced Registration
16 Training Credits
Basic Follow-on Coaching
Comprehensive Follow-on Coaching

  • To qualify for the Advanced Registration Price, full payment must be received 21 days prior to the first day of class.
  • Basic follow-on coaching includes 2 hours (max 2 calls)
  • Comprehensive follow-on coaching includes 10 hours (max 5 calls)
  • Follow-on Coaching must be purchased at time of registration.


Versal AI Engine 3


Training Duration:

2 Days

Who should attend:

Software and hardware developers, system architects, and anyone who needs to accelerate their software applications using Xilinx devices


  • Comfort with the C/C++ programming language
  • Software development flow
  • Vitis software for application acceleration development flow
  • Designing with Versal AI Engine 1 and 2

Related Courses:

Version: 2021-03-17_0932