Home

Trending

Editor Picks

Channels

Watch History

Featured Channels

The Future Is Federated
388
JdLL
235
Elena Rossini on PeerTube
119
monsterdon
102
Media Impact
78
Blurt Vibes
6
Garro
4
C+ Podcast
5
Brave-Smoke & The Yi
7
Democracy Now! News (Internet Archive Mirror)
1
笹
0
Housatonic.Live
0
e.k short videos
3
Kitboga
9
Austin Evans
4

How to fine-tune LLMs for with Tunix

Vikarti's Tech

Vikarti's Tech

Vikarti's Tech

1 followers

time

a month ago

view

1 views

ct: AIG;developers Google Purpose: Learn;Video Type:DevByte;

original uri https://www.youtube.com/watch?v=8essLqkBsX8

Unlock the full potential of your large language models with Tunix, an innovative open-source JAX-based library for post-training. This video explains the two-stage LLM training process, focusing on how Tunix excels in the post-training phase to instill strong reasoning capabilities. See a practical example of using Tunix with reinforcement learning to improve math problem-solving, leveraging its efficiency on accelerators like Google TPUs. Improve your LLM performance with this powerful tool.

Resources: GitHib for Tunix → https://goo.gle/4854A9X Tunix GRPO example → https://goo.gle/46M9UwF Additional examples → https://goo.gle/4nCfIjE DeepSeekMath(GRPO) paper → https://goo.gle/3IA5ukt

Chapters: 0:00 - Introduction to Tunix 0:17 - Understanding LLM training stages 0:35 - Tunix: A JAX-based LLM post-training library 0:50 - Exploring Tunix's capabilities and supported models 1:05 - Reinforcement learning for LLMs overview 1:25 - RLVR for math reasoning demo (GSM8K dataset) 1:50 - Setting up and training with GRPO 2:05 - Tunix performance results and benefits 2:20 - Getting involved with Tunix

Subscribe to Google for Developers → https://goo.gle/developers

Speaker: Wei Wei Products Mentioned: Google AI

Loading comments...

Related Videos

affpapa

sigma-africa

sigma-asia

sigma-europe

sigma-america

Streaming Terms Gaming Terms Sportsbook Terms Bonus Terms Privacy Policy AML KYC Responsible Gaming

TOM3 is the ultimate Streaming, Gaming, Shopping, and DeFi platform for the Web3 pioneers, practitioners and enthusiasts. TOM3 is the most trusted and transparent crypto gaming destination determined to make your Web3 experience fun and entertaining. Come join us!

Tom3.com is owned by TOM3 INC and is operated by DB Solutions Ltd, a limited liability company registered in Belize with company registration number: 000049958 and registered address: San Victor Street, Orange Walk Town, Belize. Tom3.com is licensed and regulated by the Government of the Autonomous Island of Anjouan, Union of Comoros and operates under License number: ALSI-202408058-FI3.

Tom3.com has passed all regulatory compliance and is legally authorized to conduct gaming operations for any and all games of chance and wagering. Your use of and access to Tom3.com signifies that you are at least 18 years of age or the age of majority in the jurisdiction you are accessing the website from AND you are NOT located in the following regions: Australia, France, Germany, Netherlands, Spain, UK, USA, or any FATF Blacklisted countries.

Licensed

Game Center

Lobby
Slots
Card Games
Casual
Dice
Lottery
Roulette
Scratch Cards

Stream Hub

Trending
Editor Picks
Channels
Music
Films
Vehicles
Art
Sports
Travels
Gaming
People
Comedy

Streaming Terms
Gaming Terms
Sportsbook Terms
Bonus Terms
Privacy Policy
AML
KYC
24/7 Support
Responsible Gaming

Join Us

Game Center Stream Hub

TOM3

Game Center Stream Hub

Search videos, channels...

Online:0

Google, Discord та TikTok — про захист користувачів у мережі

Google, Discord та TikTok — про захист користувачів у мережі

eQtv українською мовою

eQtv українською мовою

eQtv українською мовою

Google, Discord та TikTok — про захист користувачів у мережі

0 views

0

Захистіть свій обліковий запис Google від шахраїв

Захистіть свій обліковий запис Google від шахраїв

eQtv українською мовою

eQtv українською мовою

eQtv українською мовою

Захистіть свій обліковий запис Google від шахраїв

0 views

0

Пошук у Google у 2024 році

Пошук у Google у 2024 році

eQtv українською мовою

eQtv українською мовою

eQtv українською мовою

Пошук у Google у 2024 році

1 views

0

Все що ви хотіли знати про Google Veo3

Все що ви хотіли знати про Google Veo3

eQtv українською мовою

eQtv українською мовою

eQtv українською мовою

Все що ви хотіли знати про Google Veo3

0 views

0

Корисні інструменти Google

Корисні інструменти Google

eQtv українською мовою

eQtv українською мовою

eQtv українською мовою

Корисні інструменти Google

0 views

0

Режим блокування для iPhone. Що це, навіщо його активувати?

Режим блокування для iPhone. Що це, навіщо його активувати?

eQtv українською мовою

eQtv українською мовою

eQtv українською мовою

Режим блокування для iPhone. Що це, навіщо його активувати?

1 views

0

Все що потрібно знати про розширений пошук Google

Все що потрібно знати про розширений пошук Google

eQtv українською мовою

eQtv українською мовою

eQtv українською мовою

Все що потрібно знати про розширений пошук Google

0 views

0

Налаштування та оптимізація Google Chrome

Налаштування та оптимізація Google Chrome

eQtv українською мовою

eQtv українською мовою

eQtv українською мовою

Налаштування та оптимізація Google Chrome

0 views

0

Як знайти все за допомогою Google пошуку?

Як знайти все за допомогою Google пошуку?

eQtv українською мовою

eQtv українською мовою

eQtv українською мовою

Як знайти все за допомогою Google пошуку?

0 views

0

OSINT розвідка та пошук по фото

OSINT розвідка та пошук по фото

eQtv українською мовою

eQtv українською мовою

eQtv українською мовою

OSINT розвідка та пошук по фото

1 views

0