ML

I have been and am doing work at the Robot Intelligence through Perception Lab at TTIC.

My projects:

Active Advantage-Aligned Online Reinforcement Learning with Offline Data [link]
- We propose a priority-based data sampling policy that improves on the uniform sampling of RLPD, by incorporating the onlineness of the the transitions and their estimated advantages.
Some more are in the oven!