<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Policy Learning | Henry Kou</title><link>https://kenryhou2.github.io/tags/policy-learning/</link><atom:link href="https://kenryhou2.github.io/tags/policy-learning/index.xml" rel="self" type="application/rss+xml"/><description>Policy Learning</description><generator>HugoBlox Kit (https://hugoblox.com)</generator><language>en-us</language><lastBuildDate>Mon, 29 Jun 2026 00:00:00 +0000</lastBuildDate><image><url>https://kenryhou2.github.io/media/icon_hu_da05098ef60dc2e7.png</url><title>Policy Learning</title><link>https://kenryhou2.github.io/tags/policy-learning/</link></image><item><title>DeformX</title><link>https://kenryhou2.github.io/projects/deformx/</link><pubDate>Mon, 29 Jun 2026 00:00:00 +0000</pubDate><guid>https://kenryhou2.github.io/projects/deformx/</guid><description>&lt;p&gt;DeformX is a co-simulation framework for deformable linear objects such as wires, cables, and ropes. The project addresses a common gap in robotic manipulation simulation: visually realistic DLO assets are often procedural and physically shallow, while physics-oriented approaches can simplify slender elastic objects into rigid-link chains or generic soft bodies that miss bending, twisting, and shear behavior.&lt;/p&gt;
&lt;p&gt;The framework integrates a dedicated Cosserat rod physics engine with NVIDIA Isaac Sim. The rod engine simulates DLO dynamics, self-collisions, and contact with arbitrary free-form meshes, while mesh skinning maps discrete rod deformation onto imported CAD models for high-fidelity visualization. This combination is intended to support both realistic visual rendering and principled physics in robot-learning pipelines.&lt;/p&gt;
&lt;p&gt;The paper demonstrates DeformX across synthetic data generation and policy learning for DLO manipulation, then validates visual and physical fidelity against real-world experiments. Fine-tuning Segment Anything Model 3 on DeformX-generated data improves real-image wire segmentation by 10.2% mAP@75, and a rope-swinging policy trained entirely in DeformX reaches a mean target-hitting error of 6.6 cm on a UR5e manipulator in real-world trials.&lt;/p&gt;
&lt;video controls muted playsinline preload="metadata" width="100%"&gt;
&lt;source src="hit_apple_trials.mp4" type="video/mp4"&gt;
Your browser does not support the video tag. &lt;a href="hit_apple_trials.mp4"&gt;Download the rope-swinging trials video&lt;/a&gt;.
&lt;/video&gt;
&lt;p&gt;&lt;em&gt;Rope-swinging policy trained in DeformX executing target-hitting trials on a UR5e manipulator.&lt;/em&gt;&lt;/p&gt;
&lt;p&gt;See the
and the
. The paper was accepted to
.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Keywords:&lt;/strong&gt; DeformX, deformable linear objects, DLO simulation, Cosserat rods, NVIDIA Isaac Sim, mesh skinning, robot manipulation, synthetic data generation, policy learning, sim-to-real transfer, SAM3, UR5e.&lt;/p&gt;</description></item></channel></rss>