Get in Touch

Course Outline

Introduction to Cambricon and MLU Architecture

  • Overview of Cambricon’s AI chip portfolio
  • MLU architecture and instruction pipeline
  • Supported model types and applicable use cases

Installing the Development Toolchain

  • Installing BANGPy and the Neuware SDK
  • Setting up environments for Python and C++
  • Ensuring model compatibility and preprocessing

Model Development with BANGPy

  • Managing tensor structures and shapes
  • Constructing computation graphs
  • Support for custom operations within BANGPy

Deploying with the Neuware Runtime

  • Converting and loading models
  • Controlling execution and inference
  • Best practices for edge and data center deployment

Performance Optimization

  • Memory mapping and layer tuning
  • Tracing and profiling execution
  • Identifying and resolving common bottlenecks

Integrating MLU into Applications

  • Utilizing Neuware APIs for application integration
  • Support for streaming and multi-model scenarios
  • Hybrid CPU-MLU inference setups

End-to-End Project and Use Case

  • Lab: Deploying a vision or NLP model
  • Performing edge inference with BANGPy integration
  • Testing accuracy and throughput

Summary and Next Steps

Requirements

  • Knowledge of machine learning model structures
  • Proficiency in Python and/or C++
  • Familiarity with concepts related to model deployment and acceleration

Target Audience

  • Embedded AI developers
  • ML engineers deploying solutions to edge or data center environments
  • Developers working with Chinese AI infrastructure
 21 Hours

Number of participants


Price per participant

Testimonials (1)

Upcoming Courses

Related Categories