几篇论文实现代码:
《OpenGaussian: Towards Point-Level 3D Gaussian-based Open Vocabulary Understanding》(NeurIPS 2024) GitHub: github.com/yanmin-wu/OpenGaussian
《GeoLLM: Extracting Geospatial Knowledge from Large Language Models》(ICLR 2024) GitHub: github.com/rohinmanvi/GeoLLM
《Asynchronous Large Language Model Enhanced Planner for Autonomous Drivingr》(ECCV 2024) GitHub: github.com/memberRE/AsyncDriver
《Dolphins: Multimodal Language Model for Driving》(ECCV 2024) GitHub: github.com/SaFoLab-WISC/Dolphins [fig8]
《Vision-Language Models Can Self-Improve Reasoning via Reflection》(2024) GitHub: github.com/njucckevin/MM-Self-Improve [fig1]
《Qwen2VL-Flux: Unifying Image and Text Guidance for Controllable Image Generation》(2024) GitHub: github.com/erwold/qwen2vl-flux [fig2]
《EdgeRunner: Auto-regressive Auto-encoder for Artistic Mesh Generation》(2024) GitHub: github.com/NVlabs/EdgeRunner
《Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents》(2024) GitHub: github.com/OSU-NLP-Group/WebDreamer [fig3]
《Timer: Generative Pre-trained Transformers Are Large Time Series Models》(2024) GitHub: github.com/thuml/OpenLTM
《DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation》(2024) GitHub: github.com/wz0919/DreamRunner [fig4]
《LRAE: Large-Region-Aware Safe and Fast Autonomous Exploration of Ground Robots for Uneven Terrains》(2024) GitHub: github.com/NKU-MobFly-Robotics/LRAE
《Cautious Optimizers: Improving Training with One Line of Code》(2024) GitHub: github.com/kyleliang919/C-Optim
《SAR3D: Autoregressive 3D Object Generation and Understanding via Multi-scale 3D VQVAE》(2024) GitHub: github.com/cyw-3d/SAR3D [fig5]
《All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages》(2024) GitHub: github.com/mbzuai-oryx/ALM-Bench [fig6]
《Efficient Long Video Tokenization via Coordinate-based Patch Reconstruction》(2024) GitHub: github.com/huiwon-jang/CoordTok [fig7]
《Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation》(2024) GitHub: github.com/SuleBai/SC-CLIP
《Advancing GPU IPC for stiff affine-deformable simulation》(2024) GitHub: github.com/spiriMirror/libuipc
《Scaling Concept With Text-Guided Diffusion Models》(2024) GitHub: github.com/WikiChao/ScalingConcept
《DHP-Mapping: A Dense Panoptic Mapping System with Hierarchical World Representation and Label Optimization Techniques》(2024) GitHub: github.com/hutslib/DHP-Mapping
《OpenGaussian: Towards Point-Level 3D Gaussian-based Open Vocabulary Understanding》(NeurIPS 2024) GitHub: github.com/yanmin-wu/OpenGaussian
《GeoLLM: Extracting Geospatial Knowledge from Large Language Models》(ICLR 2024) GitHub: github.com/rohinmanvi/GeoLLM
《Asynchronous Large Language Model Enhanced Planner for Autonomous Drivingr》(ECCV 2024) GitHub: github.com/memberRE/AsyncDriver
《Dolphins: Multimodal Language Model for Driving》(ECCV 2024) GitHub: github.com/SaFoLab-WISC/Dolphins [fig8]
《Vision-Language Models Can Self-Improve Reasoning via Reflection》(2024) GitHub: github.com/njucckevin/MM-Self-Improve [fig1]
《Qwen2VL-Flux: Unifying Image and Text Guidance for Controllable Image Generation》(2024) GitHub: github.com/erwold/qwen2vl-flux [fig2]
《EdgeRunner: Auto-regressive Auto-encoder for Artistic Mesh Generation》(2024) GitHub: github.com/NVlabs/EdgeRunner
《Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents》(2024) GitHub: github.com/OSU-NLP-Group/WebDreamer [fig3]
《Timer: Generative Pre-trained Transformers Are Large Time Series Models》(2024) GitHub: github.com/thuml/OpenLTM
《DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation》(2024) GitHub: github.com/wz0919/DreamRunner [fig4]
《LRAE: Large-Region-Aware Safe and Fast Autonomous Exploration of Ground Robots for Uneven Terrains》(2024) GitHub: github.com/NKU-MobFly-Robotics/LRAE
《Cautious Optimizers: Improving Training with One Line of Code》(2024) GitHub: github.com/kyleliang919/C-Optim
《SAR3D: Autoregressive 3D Object Generation and Understanding via Multi-scale 3D VQVAE》(2024) GitHub: github.com/cyw-3d/SAR3D [fig5]
《All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages》(2024) GitHub: github.com/mbzuai-oryx/ALM-Bench [fig6]
《Efficient Long Video Tokenization via Coordinate-based Patch Reconstruction》(2024) GitHub: github.com/huiwon-jang/CoordTok [fig7]
《Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation》(2024) GitHub: github.com/SuleBai/SC-CLIP
《Advancing GPU IPC for stiff affine-deformable simulation》(2024) GitHub: github.com/spiriMirror/libuipc
《Scaling Concept With Text-Guided Diffusion Models》(2024) GitHub: github.com/WikiChao/ScalingConcept
《DHP-Mapping: A Dense Panoptic Mapping System with Hierarchical World Representation and Label Optimization Techniques》(2024) GitHub: github.com/hutslib/DHP-Mapping