1. Development of end-2-end AI tools, models or software to run quantized neural network models on Qualcomm leading edge hardware with optimal resource
2. Design and develop end-to-end test strategies and frameworks for complex software systems
3. Create and maintain automated test suites using modern testing tools and frameworks
4. Development of debugging/profiling tools and Qualcomm SDK to rapid deployment quantized models on device
5. Participate in code reviews and contribute to improving test coverage
6. Collaborate with cross-functional teams to understand requirements and design test approaches
7. Analyze test results and provide detailed feedback to development team
8. Conduct experiments to reproduce and optimize accuracies and performance of models
9. Related papers reading and summarization and presentation.
岗位要求
1. Currently pursuing or recently completed a degree in Computer Science, Artificial Intelligence, EE, or a related field.
2. Experience in large language model (LLM), vision-language model (VLM), and large vision model (LVM), including diffusers, multi-modality, CNNs, RNN/LSTMs, Transformer, and others.
3. Experience in Model quantization and compression, deployment in smartphones or other edge devices.
4. Strong mathematical skills - good foundations of linear algebra, matrix, differential algebra, statistics.
加分项
1. Have strong skill in C++ and Python programming
2. Experience in machine learning/deep learning algorithms and architectures, including CNNs, RNN/LSTMs, Transformer, LVM, LLM
3. Hands-on experience with ML frameworks, such as TensorFlow, PyTorch and Onnx Runtime
4. Have knowledge in on-device AI models deployment is a plus
5. Have knowledge in llama.cpp or ExecuTorch is a plus
6. Have knowledge in Qualcomm SNPE/QNN SDK is a plus