几篇论文实现代码:
《Conjugated Semantic Pool Improves OOD Detection with Pre-trained Vision-Language Models》(NeurIPS 2024) GitHub: github.com/MengyuanChen21/NeurIPS2024-CSP
《Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation》(NeurIPS 2024) GitHub: github.com/XiaRho/MADM [fig3]
《Learning Visual Parkour from Generated Images》(CoRL 2024) GitHub: github.com/lucidsim/lucidsim
《Generative Expressive Conversational Speech Synthesis》(2024) GitHub: github.com/walker-hyf/GPT-Talker
《ThinK: Thinner Key Cache by Query-Driven Pruning》(2024) GitHub: github.com/SalesforceAIResearch/ThinK
《Sparsh: Self-supervised touch representations for vision-based tactile sensing》(2024) GitHub: github.com/facebookresearch/sparsh [fig1]
《InstantIR: Blind Image Restoration with Instant Generative Reference》(2024) GitHub: github.com/JY-Joy/InstantIR [fig2]
《Representation Alignment for Generation: Training Diffusion Transformers is Easier Than You Think》(2024) GitHub: github.com/cloneofsimo/repa-rf [fig4]
《FaithEval: Can Your Language Model Stay Faithful to Context, Even If "The Moon is Made of Marshmallows"》(2024) GitHub: github.com/SalesforceAIResearch/FaithEval
《Turtle: Learning Truncated Causal History Model for Video Restoration》(NeurIPS 2024) GitHub: github.com/Ascend-Research/Turtle
《Skeleton-Driven Inbetweening of Bitmap Character Drawings》(2024) GitHub: github.com/kbrodt/inbetweening
《Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval》(2024) GitHub: github.com/sher222/LeReT
《Conjugated Semantic Pool Improves OOD Detection with Pre-trained Vision-Language Models》(NeurIPS 2024) GitHub: github.com/MengyuanChen21/NeurIPS2024-CSP
《Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation》(NeurIPS 2024) GitHub: github.com/XiaRho/MADM [fig3]
《Learning Visual Parkour from Generated Images》(CoRL 2024) GitHub: github.com/lucidsim/lucidsim
《Generative Expressive Conversational Speech Synthesis》(2024) GitHub: github.com/walker-hyf/GPT-Talker
《ThinK: Thinner Key Cache by Query-Driven Pruning》(2024) GitHub: github.com/SalesforceAIResearch/ThinK
《Sparsh: Self-supervised touch representations for vision-based tactile sensing》(2024) GitHub: github.com/facebookresearch/sparsh [fig1]
《InstantIR: Blind Image Restoration with Instant Generative Reference》(2024) GitHub: github.com/JY-Joy/InstantIR [fig2]
《Representation Alignment for Generation: Training Diffusion Transformers is Easier Than You Think》(2024) GitHub: github.com/cloneofsimo/repa-rf [fig4]
《FaithEval: Can Your Language Model Stay Faithful to Context, Even If "The Moon is Made of Marshmallows"》(2024) GitHub: github.com/SalesforceAIResearch/FaithEval
《Turtle: Learning Truncated Causal History Model for Video Restoration》(NeurIPS 2024) GitHub: github.com/Ascend-Research/Turtle
《Skeleton-Driven Inbetweening of Bitmap Character Drawings》(2024) GitHub: github.com/kbrodt/inbetweening
《Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval》(2024) GitHub: github.com/sher222/LeReT