Ilia Larchenko's picture

Ilia Larchenko PRO

IliaLarchenko

AI & ML interests

I am a Data Science Director with a diverse technical and business background. I live in Bangkok and work in Agoda, where I lead multiple DS and ML teams. I was an active Kaggler in TOP-20 of a global competition ranking, Competitions and Notebooks Master

Recent Activity

posted an update about 10 hours ago
I placed ๐Ÿฅˆ 2nd in the LeHome Challenge (ICRA 2026), and ๐Ÿฅ‡ 1st of 62 teams in the first simulation round. Now I'm open-sourcing the full solution โ€” code, tech report, and final weights. The task: teach a cheap two-armed robot (SO-ARM101) to fold 4 garment types โ€” long/short tops and pants. Garment category is hidden at eval. Round 1 in sim (auto-scored), round 2 on a real robot (jury-scored). I trained a VLA policy with an RL loop on top. The key ideas: ๐Ÿง  The policy is its own value function. From the same forward pass that picks the next action chunk, cheap heads predict success probability, task completion %, garment type, and future keypoint distances + a Q-residual. Those become the advantage signal for RL โ€” no separate critic. ๐Ÿ” A fully asynchronous RL loop coordinated only through the HF Hub: 1 trainer (H200) ships a fresh checkpoint ~every 40 min while N rollout workers (and a human doing teleop / DAgger corrections) collect data in parallel. Nobody waits โ€” it uses the off-policy nature of the loop to the fullest. ๐Ÿ“ˆ Binary success is too sparse, so I densify it into per-frame advantage via GAE โ€” from objective keypoint checkpoints, the success-probability value baseline, and completion %. ๐ŸŽ›๏ธ The RL combines AWR + RECAP. I also tune the inference knobs โ€” execution length, playback speed, inpainting overlap, CFG scale, best-of-N โ€” with a per-parameter Thompson-sampling bandit folded into rollout collection. ๐Ÿ”ง Round 2: with only ~1 week and no access to the eval robot โ€” so the pipeline was sim โ†’ my robot โ†’ their robot, leaning on heavy augmentation to make the policy more robust. ๐Ÿ“ Blog: https://ilialarchenko.com/projects/lehome2026 ๐Ÿ“„ Tech report: https://huggingface.co/papers/2606.27163 ๐Ÿ”ง Code: https://github.com/IliaLarchenko/lehome_solution ๐Ÿค— Sim policy: https://huggingface.co/IliaLarchenko/lehome_sim ๐Ÿค— Real policy: https://huggingface.co/IliaLarchenko/lehome_real ๐ŸŒ Challenge: https://lehome-challenge.com/
updated a model about 11 hours ago
IliaLarchenko/lehome_sim
updated a model about 11 hours ago
IliaLarchenko/lehome_real
View all activity

Organizations

ZeroGPU Explorers's profile picture LeRobot Worldwide Hackathon's profile picture Hugging Face Context Course's profile picture Robotics Team's profile picture