My Re-se-Arch Interests
My research focuses on computer Vis-Eye-on, often motivated by the task of building explainable and improvable visual Turing test and robot autonomy through lifelong communicative learning. To accomplish my research goals, I am interested in pursuing a unified framework for machines to ALTER (Ask, Learn, Test, Explain and Refine) recursively in a principled way.
• Deep Perception of the Visible and Deep Understanding of the Dark Jointly
A picture is worth a thousands of words. What are the words? They refer to, both visible and invisible, concepts and models (including patterns, symbols and logics). What are the structures orgainizing words? They refer to image/video and language grammar (hierarchical, compositional, reconfigurable, causal and explainable). In addition, "The more you look, the more you see" (quoted from Prof. Stuart Geman). My research focus on (i) statistical learning of large scale and highly expressive hierarchical and compositional models from heterogenous (big) data including images, videos and text, (ii) statistical inference by learning near-optimal cost-sensitive decision policies, and (iii) statistical theory of performance guaranteed learning algorithm and optimally scheduled inference procedure, i.e., maximizing the "gain" (accuracy) and minimizing the "pain" (computational costs).
• Lifelong Learning through ALTER Recursively (Demo)
Here, I articulate what I view as machines which can ALTER recursively using online object tracking as an example. Tracking is one of the innate capabilities in animals and humans for learning concepts (Susan Carey, The Origin of Concepts. Oxford Univ. Press, 2011).