Currently I am the director of research, AI at Salesforce. Before that I was a senior researcher at MetaMind. And I worked as a Postdoctoral Researcher Scholar at the University of California, Los Angeles (UCLA) from Jun 2014 to Sep 2015. I got my Ph.D. in the department of Computer Science and Engineering, University at Buffalo, SUNY in 2014 under the supervision of Prof. Jason J. Corso. And I got my B.S. and M.S. of Computer Science degree from Huazhong University of Science and Technology(HUST) in the year 2005 and 2007 in China..

Research Interests

Deep Learning, Video Parsing, Image Classification, Interactive Robot Learning, Natural Language Processing, Dialogue Learning, Metric Learning, Large Scale Retrieval.

Institutions

         

Links

Curriculum Vitae
Google Scholar

Contact

cxiong [at] salesforce.com

Publications

2017

Non-Autoregressive Neural Machine Translation, Jiatao Gu, James Bradbury, Caiming Xiong, Victor O.K. Li, Richard Socher.
[ pdf, blog post, dataset, Press: CNBC, Venturebeat, Slator ]

DCN+: Mixed Objective and Deep Residual Coattention for Question Answering, Caiming Xiong, Victor Zhong and Richard Socher.
[ pdf]

Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning, Victor Zhong, Caiming Xiong, and Richard Socher.
[ pdf, blog post, dataset, Press: TechCrunch, Venturebeat ]

Learned in Translation: Contextualized Word Vectors, Bryan McCann, James Bradbury, Caiming Xiong, Richard Socher.
Advances in Neural Information Processing Systems (NIPS 2017). [ pdf, blog post, code, Press: MIT Tech Review ]

A Deep Reinforced Model for Abstractive Summarization, Romain Paulus, Caiming Xiong, Richard Socher.
[ pdf, blog post, Press: Forbes, MIT Tech Review, TechCrunch]

A Joint Many-Task Model: Growing a Neural Network for Multiple NLP Tasks, Kazuma Hashimoto, Caiming Xiong, Yoshimasa Tsuruoka, Richard Socher.
The 2017 Conference on Empirical Methods on Natural Language Processing (EMNLP 2017). [ pdf, blog post]

Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning, Jiasen Lu*, Caiming Xiong*, Devi Parikh, Richard Socher.
IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2017). (* equal contribution). [ pdf] (Spotlight Presentation)

Dynamic Coattention Networks For Question Answering, Caiming Xiong, Victor Zhong, Richard Socher.
International Conference on Learning Representations (ICLR 2017).[ pdf, blog post]

Quasi-Recurrent Neural Networks, James Bradbury, Stephen Merity, Caiming Xiong, Richard Socher.
International Conference on Learning Representations (ICLR 2017).[ pdf, blog post]

Pointer Sentinel Mixture Models, Stephen Merity, Caiming Xiong, James Bradbury, Richard Socher.
International Conference on Learning Representations (ICLR 2017).[ pdf, new dataset]

2016

Dynamic Memory Networks for Visual and Textual Question Answering, Caiming Xiong, Stephen Merity, Richard Socher.
The 33rd International Conference on Machine Learning (ICML 2016). [ pdf, New York Times]

Active Clustering with Model-Based Uncertainty Reduction, Caiming Xiong, David M. Johnson, Jason Corso.
IEEE Trans. on Pattern Analysis and Machine Intelligence (TPAMI). [ pdf]

Recognizing Car Fluents from Videos, Bo Li, Tianfu Wu, Caiming Xiong, Song-Chun Zhu.
IEEE Computer Vision and Pattern Recognition (CVPR 2016). [ pdf ] (Oral Presentation)

Grounded Semantic Role Labeling, Shaohua Yang, Qiaozi Gao, Changsong Liu, Caiming Xiong, Joyce Y. Chai, Song-Chun Zhu.
The 15th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL 2016). [ pdf ]

Robot Learning with a Spatial, Temporal, and Causal And-Or Graph, Caiming Xiong, Nishant Shukla, Wenlong Xiong, Song-Chun Zhu.
IEEE International Conference on Robotics and Automation (ICRA 2016).[ pdf ]

Maximum Margin Dirichlet Process Mixtures for Clustering, Gang Chen, Haiying Zhang, Caiming Xiong.
AAAI Conference on Artificial Intelligence (AAAI 2016). [ pdf ]

Semi-Supervised Nonlinear Distance Metric Learning via Forests of Max-Margin Cluster Hierarchies, David M. Johnson, Caiming Xiong, Jason Corso.
IEEE Transactions on Knowledge and Data Engineering (TKDE). [ pdf ]

A Way out of the Odyssey: Analyzing and Combining Recent Insights for LSTMs, Shayne Longpre, Sabeek Pradhan, Caiming Xiong, Richard Socher.
[ pdf]

2015

A Unified Framework for Human-Robot Knowledge Transfer, Nishant Shukla, Caiming Xiong, Song-Chun Zhu.
AAAI Fall Symposium on AI for Human-Robot Interaction (AI-HRI 2015).[ pdf ]

Joint Action Recognition and Pose Estimation From Video, Xiaohan Nie, Caiming Xiong, Song-Chun Zhu.
IEEE Computer Vision and Pattern Recognition (CVPR 2015).[ pdf ]

Can humans fly? Action understanding with multiple classes of actors, Chenliang Xu, Shao-Hang Hsieh, Caiming Xiong, Jason Corso.
IEEE Computer Vision and Pattern Recognition (CVPR 2015).[ pdf ]

Jointly modeling deep video and compositional text to bridge vision and language in a unified framework, Ran Xu, Caiming Xiong, Wei Chen, Jason Corso.
AAAI Conference on Artificial Intelligence (AAAI 2015).[ pdf ]

2014

Seeing is worse than believing: Reading people's minds better than computer vision methods recognize actions, Andrei Barbu, Daniel P. Barrett, Wei Chen, N. Siddharth, Caiming Xiong , Jason Corso, Christiane D. Fellbaum, Catherine Hanson, Stephen Jos´e Hanson, S´ebastien H´elie, Evguenia Malaia, Barak A. Pearlmutter, Jeffrey Mark Siskind, Thomas Michael Talavage, Ronnie B. Wilbur.
European Conference on Computer Vision (ECCV 2014).[ pdf ]

Latent Domains Modeling for Visual Domain Adaptation, Caiming Xiong, Scott McCloskey, Shao-Hang Hsieh, Jason Corso.
AAAI Conference on Artificial Intelligence (AAAI 2014).[ pdf ](Oral Presentation)

Actionness Ranking with Lattice Conditional Ordinal Random Fields, Wei Chen, Caiming Xiong, Jason Corso
IEEE Computer Vision and Pattern Recognition (CVPR 2014).[ pdf , code. ]

Adaptive Quantization for Hashing: An Information-Based Approach to Learning Binary Codes, Caiming Xiong, Wei Chen, Gang Chen, David M. Johnson, Jason Corso
SIAM International Conference on Data Mining (SDM 2014).[ pdf, code. ](Oral Presentation)

Spectral Active Clustering of Remote Sensing Images, Zifeng Wang, Gui-Song Xia, Caiming Xiong, Liangpei Zhang.
IEEE International Geoscience and Remote Sensing Symposium (IGARSS 2014).[pdf]

2013

Uncertainty reduction for active image clustering via a hybrid global-local uncertainty model, Caiming Xiong, David M. Johnson, and Jason Corso.
AAAI Conference on Artificial Intelligence (Late-Breaking Papers Track) (AAAI 2013). [ pdf, code. ]

Comprehensive cross-hierarchy cluster agreement evaluation, David M. Johnson, Caiming Xiong, Jason Corso.
AAAI Conference on Artificial Intelligence (Late-Breaking Papers Track) (AAAI 2013). [ pdf, code. ]

2012

Streaming hierarchical video segmentation, Chenliang Xu*, Caiming Xiong*, Jason Corso.
European Conference on Computer Vision (ECCV 2012). [ pdf, code ] (Oral Presentation)(* equal contribution).

Random forests for metric learning with implicit pairwise position dependence, Caiming Xiong, David M. Johnson, R. Xu, Jason Corso.
ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2012). [ pdf, code ](Oral Presentation)

Coaction discovery: Segmentation of common actions across multiple videos, Caiming Xiong, David M. Johnson, R. Xu, Jason Corso.
Multimedia Data Mining Workshop in Conjunction with the ACM SIGKDD Conference on Knowledge Discovery and Data Mining (MDMKDD 2012). [ pdf, code ]

Dictionary transfer for image denoising via domain adaptation, Gang Chen, Caiming Xiong, Jason Corso.
IEEE International Conference on Image Processing (ICIP 2012). [ pdf ](Oral Presentation)

Efficient max-margin metric learning, Caiming Xiong, David M. Johnson, Jason Corso.
European Conference on Data Mining (ECDM 2012). [ pdf, code ](Best Paper Award)

Spectral active clustering via purification of the k-nearest neighbor graph, Caiming Xiong, David M. Johnson, Jason Corso.
European Conference on Data Mining (ECDM 2012). [ pdf, code ](Oral Presentations)

Online Active Constraint Selection For Semi-Supervised Clustering, Caiming Xiong, David M. Johnson, Jason Corso.
ECAI Active and Incremental Workshop, 2012. [ pdf, code ]

2011

AirTouch: Interacting With Computer Systems At A Distance., Daniel R. Schlegel, Albert Y. C. Chen, Caiming Xiong, Jeffery A. Delmerico and Jason Corso.
IEEE Winter Vision Meetings: Workshop on Applications of Computer Vision (WACV 2011). [ pdf](Oral Presentations)

Towards a parts-based approach to sub-cortical brain structure parsing, Digvijay Gagneja, Caiming Xiong, Jason Corso.
SPIE Conference on Medical Imaging, 2011. [ pdf ]

2009

From image parsing to painterly rendering, Kun Zeng, Mingtian Zhao, Caiming Xiong, Song-Chun Zhu.
ACM Transaction on Graphics, 2009 (TOG). [ pdf ]

Marker-less registration based on template tracking for augmented reality, Liang Lin, Yongtian Wang, Yue Liu, Caiming Xiong, Kun Zeng.
Multimedia Tools Applications, 2009 (MTA). [ pdf ]

Professional Services

Conference and Workshop Organization

  • Organizing Committee, Workshop on language and vision (at CVPR 2015).
Journal and Conference Reviewer
  • TPAMI, IJCV, TIP, PR, Neurocomputing, ICCV 2015, ICRA 2015, CVPR 2016, ICRA 2016, AAAI 2016, NIPS 2016, AAAI 2017, CVPR 2017, ICLR 2017, ICCV 2017, NIPS 2017