- Designed and implemented transformer-based feature extraction with Pytorch and designed sequence matching at the back-end of LCD.
- Trained on Places365 with knowledge distillation, achieving top-1 of 53.28% and top-5 of 84.04%, comparable to CNN-based models.
- Improved the average precision by 3.18% compared to state-of-the-art CNN-based methods on NewCollege and CityCentre datasets.
- Authored and published a paper in 2022 IEEE International Conference on Advanced Robotics and Mechatronics.
Light Monocular Visual Odometry through Attentive Tensor-compressed LSTM Model for Robotic VSlam (09/2020-12/2020)
- Designed a CNN+T-LSTM model with attention mechanism to estimate the 6-DoF absolute-scale pose from the optimal flow feature for the monocular visual Odometry, making it friendly to be applied on edge devices.
- Achieved 1/7 the size of DeepVO and 23x faster than Flowdometry on KITTI dataset, while deployed it to a Raspberry Pi-based robot.
- Authored and published a paper in 2021 WRC Symposium on Advanced Robotics and Automation, awarded the Best Student Paper.