Research of Song-Chun Zhu

This page is not updated, Link to my lab research page

Projects in my group are divided in five categories.

1, Image Parsing

Inspired by Marr's quest for a general vision solution, we pose image parsing aa computing process that infers the following representations from a single image under a common framework. click the buttons/text to view projects

2, Video Parsing

Video parsing extends image parsing to larger scenes and longer duration to account for the interactions between agents and manipulable objects.

3, Modeling, Learning, Inference Algorithms, and Basic Theories

The progress of computer vision as a scientific discipline should be measured by its development of models, algorithms, and theories, so that its problems and solutions can be understood analytically.

Modeling and Learning	Inference Algorithms	Basic Theories
Minimax Entropy Learning Prior Learning Learning by Information Projection Learning Implicit and Explicit Manifolds Stochastic Image Grammar And-or-graph Learning PAC-learning of And-or graph models Energy landscape of non-convex optimization problems	Region Competition (PDE) Data-Driven Markov Chain Monte Carlo (DDMCMC) Swendsen-Wang Cuts (SWC) Top-down/Bottom-up Inference Top-down/bottom-up information gains and scheduling C4: exploring multiple solutions by cluster sampling α+β+γ+C4 parsing with ambiguity	Information Scaling Theory PAC-learning in vision Order Parameter Theory Analogic learning Communicative learning

4, Real World Applications

Only when commercial requests are clearly presented, the progress shall be measured by specific datasets and benchmarks.
We have produced commericial systems based on projects at our Lotus Hill Institute.

Persistent Surveillance	Look at Humans	Computational Photography and arts	Aerial Image Understanding	Intelligent Vehicle
Background Modeling Tracking in AoG Actions Events and Ontology Image to Text (I2T) PTZ Camera Tracking Counting People	Face Modeling Hair Modeling and Simulation Clothes Modeling Face Aging Simulation Hair Sketching Curve process Human Sketching	Painterly Rendering Painterly Animation Abstract Art Rendering Stroke process rendering Portraiture by Active Templates Papercut for human portrait Cartoon Animation Artistic Face Lighting	Roof detection Hierarchical and contextual Aerial Image Understanding Scene functionality	Scene modeling Top-down/ Bottom-up Inference Driving Log (I2T)

5, Core and Long Standing Issues in Human and Computer Vision

These are long-standing debates which can only be answered numerically using statistical and information theoretical approaches.

Issue I: Discriminative vs. Generative methods
Issue II: Top-down vs. Bottom-up computing processes
Issue III: 2D view-based vs. 3D object-based representations
Issue IV: Texture vs. Texton (similarly Context v.s. Hierarchy, Sketchable v.s. Non-Sketchable, Trackable v.s. Non-Trackable)

Downloads (more downloads from my lab webpage)

Datasets	Codes	Tutorials
Lotus Hill Image Datasets	DDMCMC Image Segmentation Image Segmentation by Generalized SW-cuts Active Basis Primal sketch many other codes are available in project pages.	ICCV'05 Tutorial: Markov Chain Monte Carlo for Computer Vision

This page is not updated, Link to my lab research page

Projects in my group are divided in five categories.

1, Image Parsing

2, Video Parsing

3, Modeling, Learning, Inference Algorithms, and Basic Theories

Modeling and Learning

Inference Algorithms

Basic Theories

4, Real World Applications

Persistent Surveillance

Look at Humans

Computational Photography and arts

Aerial Image Understanding

Intelligent Vehicle

5, Core and Long Standing Issues in Human and Computer Vision

Issue I: Discriminative vs. Generative methods

Issue II: Top-down vs. Bottom-up computing processes

Issue III: 2D view-based vs. 3D object-based representations

Issue IV: Texture vs. Texton

Downloads (more downloads from my lab webpage)