Domain: GenAI (105)
video-gen (51)
- its-been-a-busy-week-for-world-models-and-their-apis โ Itโs been a busy week for World Models and their APIs! https://t.co/NDHLgQUV6Oโฆ
- new-nvidia-paper-we-introduce-motive-a-motion-centric โ New NVIDIA Paper We introduce Motive, a motion-centric, gradient-based data atโฆ
- physrvg-physics-aware-unified-reinforcement-learning-for โ PhysRVG Physics-Aware Unified Reinforcement Learning for Video Generative Modelโฆ
- what-if-an-ai-could-generate-a-5-minute-video-from-a-single โ what-if-an-ai-could-generate-a-5-minute-video-from-a-single
- world-models-dont-fail-because-they-cant-predict-the-future โ world-models-dont-fail-because-they-cant-predict-the-future
- stream-diffvsr-real-time-diffusion-based-video-super โ stream-diffvsr-real-time-diffusion-based-video-super
- stream-diffvsr-a-low-latency-streamable-video-super โ stream-diffvsr-a-low-latency-streamable-video-super
- tedla-et-al-generating-the-past-present-and-future-from-a โ tedla-et-al-generating-the-past-present-and-future-from-a
- egox-generate-immersive-first-person-video-from-any-third-pe โ egox-generate-immersive-first-person-video-from-any-third-pe
- zeng-et-al-neuralremaster-phase-preserving-diffusion-for-str โ zeng-et-al-neuralremaster-phase-preserving-diffusion-for-str
- bullettime-decoupled-control-of-time-and-camera-pose-for-vid โ bullettime-decoupled-control-of-time-and-camera-pose-for-vid
- introducing-relic-world-model-real-time-interactive-video-ge โ introducing-relic-world-model-real-time-interactive-video-ge
- video4spatial-towards-visuospatial-intelligence-with-context โ video4spatial-towards-visuospatial-intelligence-with-context
- excited-to-share-our-neurips2025-paper-physctrl-generative-p โ excited-to-share-our-neurips2025-paper-physctrl-generative-p
- meet-gaia-3-wayves-most-advanced-generative-world-model-yet- โ meet-gaia-3-wayves-most-advanced-generative-world-model-yet-
- pan-a-world-model-for-general-interactable-and-long-horizon- โ pan-a-world-model-for-general-interactable-and-long-horizon-
- nano-banana-makeugc-veo3-ai-content-factory-this-agent-pumps โ nano-banana-makeugc-veo3-ai-content-factory-this-agent-pumps
- motionstream-real-time-interactive-video-generation-with-mou โ motionstream-real-time-interactive-video-generation-with-mou
- generative-view-stitching-we-worked-with-to-tame-the-short-s โ generative-view-stitching-we-worked-with-to-tame-the-short-s
- zero-shot-video-reasoning-chain-of-frames-isnt-just-for โ zero-shot-video-reasoning-chain-of-frames-isnt-just-for
- code2video-a-code-centric-paradigm-for-educational-video โ code2video-a-code-centric-paradigm-for-educational-video
- want-to-build-๐๐-๐ ๐จ๐ฎ๐ง๐๐๐ญ๐ข๐จ๐ง-๐๐จ๐๐๐ฅ๐ฌ โ want-to-build-๐๐-๐ ๐จ๐ฎ๐ง๐๐๐ญ๐ข๐จ๐ง-๐๐จ๐๐๐ฅ๐ฌ
- 4dnex-feed-forward-4d-generative-modeling-made-easy โ 4dnex-feed-forward-4d-generative-modeling-made-easy
- 1957439501644562640 โ CARTOON HERO 1.0 - AI Animation System
- 1957182570309013714 โ Tencent Hunyuan Open Source Alternative to Genie 3
- i-asked-veo-3-to-create-video-game-scenes-to-famous-places-results-are-wild-10 โ i-asked-veo-3-to-create-video-game-scenes-to-famous-places-results-are-wild-10
- epona-autoregressive-diffusion-world-model-for-autonomous-driving โ epona-autoregressive-diffusion-world-model-for-autonomous-driving
- immersegen-agent-guided-immersive-world-generation-with-alpha-textured-proxies-h โ immersegen-agent-guided-immersive-world-generation-with-alpha-textured-proxies-h
- introducing-unirelight-a-general-purpose-relighting-framework-powered-by-video โ introducing-unirelight-a-general-purpose-relighting-framework-powered-by-video
- -excited-to-introduce-simworld-an-embodied-simulator-for-infinite-photorealistic โ -excited-to-introduce-simworld-an-embodied-simulator-for-infinite-photorealistic
- the-context-size-of-video-world-models-is-only-a-few-frames-like-a-human-with-se โ the-context-size-of-video-world-models-is-only-a-few-frames-like-a-human-with-se
- preprint-of-today-sun-et-al-unigeo-taming-video-diffusion-for-unified-consistent โ preprint-of-today-sun-et-al-unigeo-taming-video-diffusion-for-unified-consistent
- preprint-of-today-wang-et-al-ati-any-trajectory-instruction-for-controllable-vid โ preprint-of-today-wang-et-al-ati-any-trajectory-instruction-for-controllable-vid
- frame-in-n-out-unbounded-controllable-image-to-video-generation-httpstcopoiu2jr6 โ frame-in-n-out-unbounded-controllable-image-to-video-generation-httpstcopoiu2jr6
- vid2world-crafting-video-diffusion-models-to-interactive-world-models-httpstcoad โ vid2world-crafting-video-diffusion-models-to-interactive-world-models-httpstcoad
- nvidia-just-dropped-gen3c-3d-informed-world-consistent-video-generation-with-pre โ nvidia-just-dropped-gen3c-3d-informed-world-consistent-video-generation-with-pre
- excited-to-introduce-gen3c-cvpr2025-a-generative-video-model-with-an-explicit-3d โ excited-to-introduce-gen3c-cvpr2025-a-generative-video-model-with-an-explicit-3d
- wow-the-new-skyreels-video-model-allows-for-really-precise-editing-via-flowedit โ wow-the-new-skyreels-video-model-allows-for-really-precise-editing-via-flowedit
- introducing-diffusionrenderer-a-neural-rendering-engine-powered-by-video โ introducing-diffusionrenderer-a-neural-rendering-engine-powered-by-video
- dreamdrive-generative-4d-scene-modeling-from-street-view-images-pointscoder-boyi โ dreamdrive-generative-4d-scene-modeling-from-street-view-images-pointscoder-boyi
- big-odysseyml-news-were-unveiling-explorer-a-generative-world-model-explorer-tra โ big-odysseyml-news-were-unveiling-explorer-a-generative-world-model-explorer-tra
- 3dtrajmaster-mastering-3d-trajectory-for-multi-entity-motion-in-video-generation โ 3dtrajmaster-mastering-3d-trajectory-for-multi-entity-motion-in-video-generation
- after-reconx-further-explore-the-potential-of-video-diffusion-and-propose-dimens โ after-reconx-further-explore-the-potential-of-video-diffusion-and-propose-dimens
- google-presents-vidpanos-generative-panoramic-videos-from-casual-panning-videos โ google-presents-vidpanos-generative-panoramic-videos-from-casual-panning-videos
- diffusion-models-continue-to-get-stronger-reconx-reconstruct-any-scene-from โ diffusion-models-continue-to-get-stronger-reconx-reconstruct-any-scene-from
- animate3d-animating-any-3d-model-with-multi-view-video-diffusion-recent โ animate3d-animating-any-3d-model-with-multi-view-video-diffusion-recent
- can-generative-models-synthesize-policy-networks-for-agents-like-text-to-image-g โ can-generative-models-synthesize-policy-networks-for-agents-like-text-to-image-g
- can-generative-models-synthesize-policy-networks-for-agents-like-text-to-image โ can-generative-models-synthesize-policy-networks-for-agents-like-text-to-image
- heading-to-in-please-dm-me-if-youd-like-to-have-a-chat-espec โ heading-to-in-please-dm-me-if-youd-like-to-have-a-chat-espec
- fine-grained-controllable-video-generation-via-object-appearance-and-context-pap โ fine-grained-controllable-video-generation-via-object-appearance-and-context-pap
- physics-aware-unified-reinforcement-learning-for-video-generative-models โ Physics-Aware Unified Reinforcement Learning for Video Generative Models
diffusion (14)
- just-found-the-dllm-library-to-create-diffusion-language-mod โ just-found-the-dllm-library-to-create-diffusion-language-mod
- added-confidence-aware-parallel-decoding-to-my-tiny-text-dif โ added-confidence-aware-parallel-decoding-to-my-tiny-text-dif
- tired-to-go-back-to-the-original-papers-again-and-again-our โ tired-to-go-back-to-the-original-papers-again-and-again-our
- ominicontrol-has-been-presented-as-a-highlight-at-iccv-2025 โ ominicontrol-has-been-presented-as-a-highlight-at-iccv-2025
- 1955901155726516652 โ Google Nano-Banana Image Model Results
- flow-matching-fm-is-one-of-the-hottest-ideas-in-generative-ai---and-its โ flow-matching-fm-is-one-of-the-hottest-ideas-in-generative-ai---and-its
- diffusion-แแ ฉแแ ฆแฏแแ ณแซ-aaa-แแ ฆแแ ตแท-แแ ขแแ กแฏแแ ด-แแ ขแ แ ฉแแ ฎแซ-แแ ฅแซแแ ฎแแ กแแ ตแธแแ ตแแ ก---diffusion-แแ ตแแ กแซ-แแ ขแผแแ ฅแผแแ งแผ-ai-แแ ฉแแ ฆแฏแแ ต-แแ กแแ ฆแแ ข-aaa-แแ ฆ โ diffusion-แแ ฉแแ ฆแฏแแ ณแซ-aaa-แแ ฆแแ ตแท-แแ ขแแ กแฏแแ ด-แแ ขแ แ ฉแแ ฎแซ-แแ ฅแซแแ ฎแแ กแแ ตแธแแ ตแแ ก---diffusion-แแ ตแแ กแซ-แแ ขแผแแ ฅแผแแ งแผ-ai-แแ ฉแแ ฆแฏแแ ต-แแ กแแ ฆแแ ข-aaa-แแ ฆ
- normally-changing-robot-policy-behavior-means-changing-its-weights-or-relying โ normally-changing-robot-policy-behavior-means-changing-its-weights-or-relying
- want-to-generate-realistic-handobject-manipulations-for-unseen-objects-check-out โ want-to-generate-realistic-handobject-manipulations-for-unseen-objects-check-out
- google-presents-lightlab-controlling-light-sources-in โ google-presents-lightlab-controlling-light-sources-in
- how-much-data-do-you-need-to-train-an-ai-model-to-generate-realistic-photos โ how-much-data-do-you-need-to-train-an-ai-model-to-generate-realistic-photos
- cosmos-is-a-developer-first-platform-designed-to-help-physical-ai-builders-accel โ cosmos-is-a-developer-first-platform-designed-to-help-physical-ai-builders-accel
- this-is-a-single-uncut-video-showing-a-robot-learning-several-tasks-instantly-af โ this-is-a-single-uncut-video-showing-a-robot-learning-several-tasks-instantly-af
- deep-learning-for-computer-vision-dl4cv-learn-about-modern-methods-for-computer โ deep-learning-for-computer-vision-dl4cv-learn-about-modern-methods-for-computer
image-gen (13)
- ubisoft-la-forge-open-sourced-its-pbr-material-model-chord-t โ ubisoft-la-forge-open-sourced-its-pbr-material-model-chord-t
- 1960464912758493410 โ Nano Banana + Runway Act 2 for Creative Flexibility
- -what-if-gaud-had-midjourney-studiotimfu-s-living-sketches-explore-architecture โ -what-if-gaud-had-midjourney-studiotimfu-s-living-sketches-explore-architecture
- chinas-tencent-just-dropped-hunyuancustom-this-ai-turns-any โ chinas-tencent-just-dropped-hunyuancustom-this-ai-turns-any
- this-object-removal-workflow-kinda-slaps-thoughts-link-to โ this-object-removal-workflow-kinda-slaps-thoughts-link-to
- a-deep-mathematical-exploration-of-diffusion-models-for-modern-ai-image-generati โ a-deep-mathematical-exploration-of-diffusion-models-for-modern-ai-image-generati
- nvidia-presents-add-it-training-free-object-insertion-in-images-with-pretrained โ nvidia-presents-add-it-training-free-object-insertion-in-images-with-pretrained
- generating-images-with-4-bit-flux-schnell-on-my-m1-max-laptop-is-pretty-awesome-1 โ generating-images-with-4-bit-flux-schnell-on-my-m1-max-laptop-is-pretty-awesome-1
- generating-images-with-4-bit-flux-schnell-on-my-m1-max-laptop-is-pretty-awesome โ generating-images-with-4-bit-flux-schnell-on-my-m1-max-laptop-is-pretty-awesome
- mlx-flux-with-mlxdiffusionkit-amazingly-it-works-on-mac-mini16gb-even-though โ mlx-flux-with-mlxdiffusionkit-amazingly-it-works-on-mac-mini16gb-even-though
- mlx-flux-with-mlxdiffusionkit-amazingly-it-works-on-mac-mini16gb-even-though-the โ mlx-flux-with-mlxdiffusionkit-amazingly-it-works-on-mac-mini16gb-even-though-the
- physical-light-real-time-diffusion-exploration-a-thread-1n โ physical-light-real-time-diffusion-exploration-a-thread-1n
- excited-to-share-our-work-on-neural-assets-a-new-method-for-enabling-3d-asset โ excited-to-share-our-work-on-neural-assets-a-new-method-for-enabling-3d-asset
text-to-X (10)
- unsloth-has-a-great-guide-on-fine-tuning-llms-selecting-the- โ unsloth-has-a-great-guide-on-fine-tuning-llms-selecting-the-
- แแ ช-แแ ตแแ ฅ-แแ ฅแผแแ กแฏ-แแ ฉแแแ ณแซแแ ฆ-แแ กแซ-แแ ณแ แ ขแแ ฉ-rag-แแ ฎแแ ฎแจแแ กแฏ-แแ ขแแ กแแ ก-แแ ซ-แแ ฎแฎแแ ต-แแ ฆแจแแ ฅแ แ ฉ-แแ ฉแฏแแ กแแ กแแ ณแซ-llmแแ ฆแแ ฆ-แแ ฅแซแแ ฆแจ โ แแ ช-แแ ตแแ ฅ-แแ ฅแผแแ กแฏ-แแ ฉแแแ ณแซแแ ฆ-แแ กแซ-แแ ณแ แ ขแแ ฉ-rag-แแ ฎแแ ฎแจแแ กแฏ-แแ ขแแ กแแ ก-แแ ซ-แแ ฎแฎแแ ต-แแ ฆแจแแ ฅแ แ ฉ-แแ ฉแฏแแ กแแ กแแ ณแซ-llmแแ ฆแแ ฆ-แแ ฅแซแแ ฆแจ
- openai-devday-ํค๋ ธํธ-์ง๊ทนํ-๊ฐ์ธ์ ์ธ-์์ฝ-1-apps-sdk-gpts-action-์๋ค๊ฐ โ openai-devday-ํค๋ ธํธ-์ง๊ทนํ-๊ฐ์ธ์ ์ธ-์์ฝ-1-apps-sdk-gpts-action-์๋ค๊ฐ
- this-is-the-answer-to-how-humans-and-machines-collaborate-in-the-generative-ai-e โ this-is-the-answer-to-how-humans-and-machines-collaborate-in-the-generative-ai-e
- -we โ -we
- announcing-our-13m-funding-round-to-build-the-next-generation-of-ai โ announcing-our-13m-funding-round-to-build-the-next-generation-of-ai
- for-a-collection-of-advanced-retrieval-augmented-generation-rag-techniques-this-1 โ for-a-collection-of-advanced-retrieval-augmented-generation-rag-techniques-this-1
- clip-is-the-default-choice-for-most-multimodal-llm-research-but-we-know-clip-is-1 โ clip-is-the-default-choice-for-most-multimodal-llm-research-but-we-know-clip-is-1
- multimodal-llms-are-just-superb-to-play-with-hacking-around-with-qwen2-vl-and-it โ multimodal-llms-are-just-superb-to-play-with-hacking-around-with-qwen2-vl-and-it
- pre-releasing-dynamic-rendering-in-gradio-the-idea-is-that-n โ pre-releasing-dynamic-rendering-in-gradio-the-idea-is-that-n
3D-gen (6)
- tinker-diffusions-gift-to-3d-multi-view-consistent-editing-from-sparse-input โ tinker-diffusions-gift-to-3d-multi-view-consistent-editing-from-sparse-input
- monocular-dynamic-reconstruction-is-hard-single-video-input-gt-a-lot-of-3d โ monocular-dynamic-reconstruction-is-hard-single-video-input-gt-a-lot-of-3d
- interested-in-3d-vision-3d-scene-understanding-and-3d-generation-we-have-open-ph โ interested-in-3d-vision-3d-scene-understanding-and-3d-generation-we-have-open-ph
- cvpr-2025-paper-alert-paper-title-magicarticulate-make-your-3d-models-articulati โ cvpr-2025-paper-alert-paper-title-magicarticulate-make-your-3d-models-articulati
- real2code-translating-real-world-articulated-objects-to-sim โ real2code-translating-real-world-articulated-objects-to-sim
- using-gradio-to-generate-3d-objects-for-your-vision-pro โ using-gradio-to-generate-3d-objects-for-your-vision-pro
visionos (5)
- how-can-we-create-interactive-physical-digital-twins-from-videos โ how-can-we-create-interactive-physical-digital-twins-from-videos
- want-to-generate-bimanual-hand-interactions-given-an-articulated-object โ want-to-generate-bimanual-hand-interactions-given-an-articulated-object
- 13-zerocomp-is-being-presented-as-an-oral-today-at-wacv2025 โ 13-zerocomp-is-being-presented-as-an-oral-today-at-wacv2025
- step-by-step-diffusion-an-elementary-tutorial โ step-by-step-diffusion-an-elementary-tutorial
- ๐๐ฎ๐๐ฎ-๐ฃ๐ถ๐ฝ๐ฒ๐น๐ถ๐ป๐ฒ๐-๐ถ๐ป-๐ ๐ฎ๐ฐ๐ต๐ถ๐ป๐ฒ-๐๐ฒ๐ฎ๐ฟ๐ป๐ถ๐ป๐ด-๐ฆ๐๐๐๐ฒ๐บ๐-can-become-complex-and-for-a-good โ ๐๐ฎ๐๐ฎ-๐ฃ๐ถ๐ฝ๐ฒ๐น๐ถ๐ป๐ฒ๐-๐ถ๐ป-๐ ๐ฎ๐ฐ๐ต๐ถ๐ป๐ฒ-๐๐ฒ๐ฎ๐ฟ๐ป๐ถ๐ป๐ด-๐ฆ๐๐๐๐ฒ๐บ๐-can-become-complex-and-for-a-good
3d-gen (3)
- 1965495039217504464 โ AI-Powered Automatic PBR Texturing for Any Mesh
- 1963580239541363098 โ 3D Foundation Models Hiring - Munich & London
- 1959608339337281586 โ Future of Architecture with AI - Image to 3D
editing (1)
- 1948878604928254257 โ Aleph Instantaneous Inpainting - Remove Reflections
web-graphics (1)
- blenderfusion-3d-grounded-visual-editing-and-generative-compositing โ blenderfusion-3d-grounded-visual-editing-and-generative-compositing
reconstruction (1)
- can-generative-video-models-help-pose-estimation-yes-we-find-that-generative-vid โ can-generative-video-models-help-pose-estimation-yes-we-find-that-generative-vid