【[105星]grps_trtllm:比vLLM更高效的OpenAI LLM服务。亮点:1. 纯C++实现,性能大幅提升;2. 支持多模态、AI Agents和分布式多GPU推理;3. 提供Gradio聊天界面,交互更友好】
'Higher performance OpenAI LLM service than vLLM serve: A pure C++ high-performance OpenAI LLM service implemented with GPRS+TensorRT-LLM+Tokenizers.cpp, supporting chat and function call, AI agents, distributed multi-GPU inference, multimodal capabilities, and a Gradio chat interface.'
GitHub: github.com/NetEase-Media/grps_trtllm
#高性能LLM# #多模态# #AI Agents# #AI创造营#
'Higher performance OpenAI LLM service than vLLM serve: A pure C++ high-performance OpenAI LLM service implemented with GPRS+TensorRT-LLM+Tokenizers.cpp, supporting chat and function call, AI agents, distributed multi-GPU inference, multimodal capabilities, and a Gradio chat interface.'
GitHub: github.com/NetEase-Media/grps_trtllm
#高性能LLM# #多模态# #AI Agents# #AI创造营#