All
Search
Images
Videos
Shorts
Maps
News
More
Shopping
Flights
Travel
Notebook
Report an inappropriate content
Please select one of the options below.
Not Relevant
Offensive
Adult
Child Sexual Abuse
Length
All
Short (less than 5 minutes)
Medium (5-20 minutes)
Long (more than 20 minutes)
Date
All
Past 24 hours
Past week
Past month
Past year
Resolution
All
Lower than 360p
360p or higher
480p or higher
720p or higher
1080p or higher
Source
All
Dailymotion
Vimeo
Metacafe
Hulu
VEVO
Myspace
MTV
CBS
Fox
CNN
MSN
Price
All
Free
Paid
Clear filters
SafeSearch:
Moderate
Strict
Moderate (default)
Off
Filter
Direct Preference Optimization (DPO) explained
100 views
Dec 27, 2024
substack.com
論文紹介:Direct Preference Optimization: Your Language Mod
…
Aug 19, 2024
speakerdeck.com
How to fine-tune GPT-4o with DPO on Azure OpenAI | Pradip Tivhale
…
11 months ago
linkedin.com
Direct Nash Optimization: Teaching language models to self-improve
…
Sep 3, 2024
Microsoft
0:14
How to Choose the Right Glass Frames | Eyebuydirect Recommen
…
2M views
Apr 19, 2024
TikTok
eyebuydirect
1:01
This AI Avatar Listens & Reacts Live | Avatar Forcing
768 views
1 month ago
YouTube
AIQUEST Shorts
7:52
21. Direct Preference Optimization (DPO) (Rafailov et al., 2023)
14 views
3 months ago
YouTube
LOADING_
5:27
How AI Models Are Tuned to Follow Instructions : RLHF vs DPO
13 views
2 months ago
YouTube
AI Strategy & Trends
0:14
Aligning LLMs with Human Preferences
3 views
2 weeks ago
YouTube
The AI Opus
1:06
Avatar Forcing: Real-Time Interactive AI Avatars
1 month ago
YouTube
ML Insights
25:57
Beyond the Button: How AI Learns From Your Feedback
1 week ago
YouTube
My Weird Prompts
0:12
What DPO Really Is (and What It Assumes) #ml #ai #coding #data #
…
62 views
3 weeks ago
YouTube
MLSimplified
7:38
Qwen-Image: Advances in Text Rendering and Precise Image Editi
…
20 views
2 months ago
YouTube
AI Paper Review
6:41
Day 4 : Master Generative AI Training: LoRA, RLHF, and Fine-T
…
107 views
2 months ago
YouTube
Cloud and Coffee with Navnit
3:28
Enhancing Song Generation in LLMs using DPO-based Multi-Pref
…
5 views
2 months ago
YouTube
Quang Phạm Việt
4:08
Avatar Forcing: Real-Time Interactive Head Avatars
1 month ago
YouTube
AI Research Roundup
2:28
Five ML Concepts - #2
159 views
4 weeks ago
YouTube
Software Wrighter
6:36
How Artificial Intelligence Reasons - Unlocking the Blackbox of LLM m
…
18 views
2 months ago
YouTube
Thinking Tower
4:35
Avatar Forcing: Real-Time Interactive Head Avatar Generatio
…
1 views
1 month ago
YouTube
CosmoX
1:52
Rajiv Shah on Instagram: "Think you know how Reinforcement Learnin
…
4.7K views
4 months ago
Instagram
rajistics
31:25
DPO的缺陷及其变体 ORPO KTO SimPO DPOP IPO LD-DPO
3.5K views
3 weeks ago
bilibili
东川路第一可爱猫猫虫
19:19
【DPO】直接偏好优化 详细原理推导 快速上手实战
3.1K views
3 weeks ago
bilibili
东川路第一可爱猫猫虫
Federated Fine-Tuning of Large Language Models: Kahneman-Tve
…
9 months ago
acm.org
Introduction to Discrete Choice Models
11.4K views
Feb 2, 2018
YouTube
Introduction to choice models
3:37
A.14 Revealed preference | Consumption - Microeconomics
84K views
Oct 19, 2014
YouTube
Policonomics
9:50
Optimization with Calculus 1
762.3K views
Jun 16, 2008
YouTube
Khan Academy
4:41
A 12-year-old app developer | Thomas Suarez | TED
11.8M views
Oct 24, 2012
YouTube
TED
59:36
Mod-01 Lec-30 Unconstarined optimization techniques : Direct s
…
73.7K views
Jun 11, 2014
YouTube
nptelhrd
58:34
The Year of Pluto - New Horizons Documentary Brings Humanity Cl
…
21.1M views
Jun 12, 2015
YouTube
NASA
20:54
MDPO: Multi-Granularity Direct Preference Optimization for Mathe
…
8 months ago
Vimeo
Conference Catalysts, LLC
See more videos
More like this
Feedback