专栏名称: 爱可可-爱生活
知名互联网资讯博主 北邮PRIS模式识别实验室陈老师
目录
相关文章推荐
爱可可-爱生活  ·  归一化Transformer (nGPT) ... ·  4 天前  
人工智能那点事  ·  保安持棍棒打死流浪狗,单位通报解聘!网友吵翻了 ·  5 天前  
宝玉xp  ·  #开源项目推荐# Ant Design ... ·  1 周前  
51好读  ›  专栏  ›  爱可可-爱生活

几篇论文实现代码:《Paralinguistics-Aware -20241130201336

爱可可-爱生活  · 微博  · AI  · 2024-11-30 20:13

正文

2024-11-30 20:13

几篇论文实现代码:
《Paralinguistics-Aware Speech-Empowered LLMs for Natural Conversation》(NeurIPS 2024) GitHub: github.com/naver-ai/usdm [fig1]
《Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging》(NeurIPS 2024) GitHub: github.com/LZY-the-boys/Twin-Merging [fig5]
《Agent Planning with World Knowledge Model》(NeurIPS 2024) GitHub: github.com/zjunlp/WKM [fig9]
《Video Depth without Video Models》(2024) GitHub: github.com/prs-eth/RollingDepth
《ChatRex: Taming Multimodal LLM for Joint Perception and Understanding》(2024) GitHub: github.com/IDEA-Research/ChatRex [fig2]
《Star Attention: Efficient LLM Inference over Long Sequences》(2024) GitHub: github.com/NVIDIA/Star-Attention [fig3]
《Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient》(2024) GitHub: github.com/czg1225/CoDe [fig4]
《Identity-Preserving Text-to-Video Generation by Frequency Decomposition》(2024) GitHub: github.com/PKU-YuanGroup/ConsisID
《Caltech Aerial RGB-Thermal Dataset in the Wild》(2024) GitHub: github.com/aerorobotics/caltech-aerial-rgbt-dataset
《From Text to Pose to Image: Improving Diffusion Model Control and Quality》(2024) GitHub: github.com/clement-bonnet/text-to-pose
《ALIKED: A Lighter Keypoint and Descriptor Extraction Network via Deformable Transformation》(2024) GitHub: github.com/MrNeRF/ALIKED_CPP
《LightGlue: Local Feature Matching at Light Speed》(2024) GitHub: github.com/MrNeRF/Light_Glue_CPP
《PreF3R: Pose-Free Feed-Forward 3D Gaussian Splatting from Variable-length Image Sequence》(2024) GitHub: github.com/ComputationalRobotics/PreF3R
《V^3: Viewing Volumetric Videos on Mobiles via Streamable 2D Dynamic Gaussians》(2024) GitHub: github.com/AuthorityWang/VideoGS [fig6]
《SketchAgent: Language-Driven Sequential Sketch Generation》(2024) GitHub: github.com/yael-vinker/SketchAgent [fig7]
《TRACE: Temporal Grounding Video LLM via Casual Event Modeling》(2024) GitHub: github.com/gyxxyg/TRACE [fig8]
《Probabilistic Inference in Language Models via Twisted Sequential Monte Carlo》(2024) GitHub: github.com/Silent-Zebra/twisted-smc-lm
《Event-3DGS: Event-based 3D Reconstruction Using 3D Gaussian Splatting》(2024) GitHub: github.com/lanpokn/Event-3DGS