I am a third-year Ph.D. student at the University of Rochester, advised by Prof. Jiebo Luo, and a quantitive researcher at JQ Investments. Previously I worked as an algorithm researcher at Megvii since late 2017 to December 2020, leading the R&D of scene text comprehension algorithms in Face++/FaceID. My research interest lies in computer vision and deep learning. I received my B.S. degree in Software Engineering of Beihang University, 2016. My detailed CV is available here.
Cloud2Sketch: Augmenting Clouds with Imaginary Sketches
Real-Time Scene Text Detection with Differentiable Binarization and Adaptive Scale Fusion
SparseDet: Towards End-to-End 3D Object Detection
Jianhong Han, Zhaoyi Wan, Zhe Liu, Jie Feng, and Bingfeng Zhou.
17th International Conference on Computer Vision Theory and Applications. (Best Student Paper Award)
Facial Attribute Transformers for Precise and Robust Makeup Transfer
Zhaoyi Wan, Haoran Chen, Wentao Jiang, Cong Yao, and Jiebo Luo.
IEEE Winter Conference of Applications on Computer Vision (WACV).
Slender Object Detection: Diagnoses and Improvements
On Vocabulary Reliance in Scene Text Recognition
Zhaoyi Wan, Jielei Zhang, Liang Zhang, Cong Yao, and Jiebo Luo.
IEEE/CVF Conferences on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, June 2020.
Real-time Scene Text Detection with Differentiable Binarization
TextScanner: Reading Characters in Order for Robust Scene Text Recognition
Zhaoyi Wan, Minghang He, Haoran Chen, Xiang Bai, and Cong Yao.
The 34th AAAI Conference on Artificial Intelligence (AAAI), New York, NY, February 2020.
Scene text recognition from two-dimensional perspective
Minghui Liao, Jian Zhang, Zhaoyi Wan, Fengming Xie, Jiajun Liang, Cong Yao, and Xiang Bai
The 33rd AAAI Conference on Artificial Intelligence (AAAI), Honolulu, HI, February 2019.(oral presentation)