专栏名称: 自动驾驶之心

自动驾驶开发者社区，关注计算机视觉、多维感知融合、部署落地、定位规控、领域方案等，坚持为领域输出最前沿的技术方向！

目录

相关文章推荐

一条 · 黄宇兴、Mr.Doodle原作，聚焦现当代艺 ... · 3 天前

一条 · 古沉金丝楠木茶道具，重现宋式茶席风雅，贺岁蛇年吉祥 · 4 天前

一条 · 众筹丨柴烧裸烧手作乌金泥系列，落灰与火痕交融 ... · 5 天前

一条 · 蛇年贺岁款首饰，灵蛇、红花国风元素，传递新年祝福 · 4 天前

一条 · 一条创始人徐沪生亲授：打造有专业、价值、审美 ... · 3 天前

51好读 › 专栏 › 自动驾驶之心

盘一盘世界模型与自动驾驶场景生成

自动驾驶之心 · 公众号 · · 2025-01-12 00:00

正文

作者 | 冰锐编辑 | 自动驾驶之心

原文链接：https://zhuanlan.zhihu.com/p/686277501

点击下方卡片，关注“ 自动驾驶之心 ”公众号

戳我-> 领取 自动驾驶近15个 方向学习路线

>> 点击进入→ 自动驾驶之心 『世界模型』 技术交流群

本文只做学术分享，如有侵权，联系删文

框架

mmagic： https://github.com/open-mmlab/mmagic

综述

World Models for Autonomous Driving: An Initial Survey

The JEPA model aims to construct mapping relationships between different inputs in the encoding space by minimizing input information and prediction errors.

Towards Knowledge-driven Autonomous Driving

Embodied AI is a facet of intelligence emphasizing the direct interaction between an intelligent system and its environment, involving perception, understanding, and action.

扩散模型

Diffusion model

扩散模型 - Diffusion Model【李宏毅2023】_哔哩哔哩_bilibili

Diffusion扩散模型大白话讲解，看完还不懂？不可能！

多模态预训练CLIP

Diffusion论文：Denoising Diffusion Probabilistic Models

StableDiffusion论文：High-Resolution Image Synthesis with Latent Diffusion Models

Diffusers

stable-diffusion-v1-5权重地址：runwayml/stable-diffusion-v1-5 at main

pytorch手写Diffusion Model：The Annotated Diffusion Model

Stable Diffusion with Diffusers

手写扩散模型-diffuers介绍_哔哩哔哩_bilibili

世界模型

GAIA-1（2023.9.29）

MAGICDRIVE（2024.1.26）

论文地址：MAGICDRIVE: STREET VIEW GENERATION WITHDIVERSE 3D GEOMETRY CONTROL

github：https://github.com/cure-lab/MagicDrive

Drive WM（2023.11.29）

https://github.com/BraveGroup/Drive-WM?tab=readme-ov-file

MUVO（2023.11.23）

多模态生成

DriveDreamer（2023.11.27）

https://github.com/JeffWang987/DriveDreamer

https://drivedreamer.github.io/

DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation（2024.3.11）

多视角视频生成

WorldDreamer

https://world-dreamer.github.io/

OccWorld: Learning a 3D Occupancy World Model for Autonomous Driving

Think2Drive: Efficient Reinforcement Learning by Thinking in Latent World Model for Quasi-Realistic Autonomous Driving (in CARLA-v2)

Driving with LLMs

DRIVEVLM: The Convergence of Autonomous Driving and Large Vision-Language Models（2024.2.25）

DriveVLM集成了场景描述、场景分析和分层规划的思维链（CoT）模块

VLM计算量大，提出DriveVLM Dual，将DriveVLM的优势与传统自动驾驶流水线协同混合

据说可以部署在orin芯片上，但猜测只是在orin上跑通，还没有大规模部署到车上

DriveVLM识别了一个倒塌的树木，并给出了稍微向右偏移的驾驶决策

Driving with LLMs: Fusing Object-Level Vector Modality for Explainable Autonomous Driving

论文地址：Driving with LLMs: Fusing Object-Level Vector Modality for Explainable Autonomous Driving

github：https://github.com/wayveai/Driving-with-LLMs

ADriver-I: A General World Model for Autonomous Driving

请到「今天看啥」查看全文

推荐文章

一条 · 黄宇兴、Mr.Doodle原作，聚焦现当代艺术拍卖，在线热拍

3 天前

一条 · 古沉金丝楠木茶道具，重现宋式茶席风雅，贺岁蛇年吉祥

4 天前

一条 · 众筹丨柴烧裸烧手作乌金泥系列，落灰与火痕交融，珍贵乌金土制成

5 天前

一条 · 蛇年贺岁款首饰，灵蛇、红花国风元素，传递新年祝福

4 天前

一条 · 一条创始人徐沪生亲授：打造有专业、价值、审美的创始人IP

3 天前

创业咖 · 合伙做生意长久的原则，不管做哪行都如此

7 年前

中国经济网 · 涨姿势！退休金≠养老金，原来它不需缴费就可享受

7 年前

上海闵行 · 【资讯】位于闵行华漕的大圆通寺开工重建，北师大上海佛学文化研究中心将坐落于此

7 年前

金融先生MrFinance · 金融先生 | 第六届全国交易大赛

7 年前

晚安少年 · VOL.706 “你很好”“呵呵”

7 年前

Sov5搜索 · 小百科 · 今天看啥 · 移动版

51好读 - 好文章就要读起来!