Rice University

Events at Rice


Electrical and Computer Engineering
Digital Signal Processing
Houston Chapter IEEE Circuits and Systems Society
IEEE Signal Processing Society

Speaker: Shiv N. Vitaladevuni
Scientist - Speech, Language and Multimedia Unit
Raytheon BBN Technologies

Efficient Sparse Projection with Application to Scene and Video Classification

Thursday, March 28, 2013
4:00 PM  to 4:50 PM

1070  George R. Brown Hall
Rice University
6100 Main St
Houston, Texas, USA

The Speech, Language and Multimedia unit at BBN has wide ranging interests in machine learning in several government funded research programs, including natural language processing (DARPA DEFT), speech (DARPA BOLT), image/video (IARPA ALADDIN), bio-medical data (DARPA DCAPS), etc. I will begin the talk with a brief introduction to some of the ongoing projects at BBN, and an overview of the IARPA ALADDIN project that focusses on semantic analysis of web videos.

One of the tasks under ALADDIN is that of video event detection. Sparse projection has been shown to be highly effective in related domains, e.g., image denoising and scene / object classification. However, practical application to large scale problems such as video analysis requires efficient versions of sparse projection algorithms such as Orthogonal Matching Pursuit (OMP). In particular, random projection based locality sensitive hashing (LSH) has been proposed for OMP. We present a novel technique called Comparison Hadamard random projection (CHRP) for further improving the efficiency of LSH within OMP. CHRP combines two techniques: (1) The Fast Johnson-Lindenstrauss Transform (FJLT) which uses a randomized Hadamard transform and sparse projection matrix for LSH, and (2) Achlioptas' random projection that uses only addition and comparison operations. Our approach provides the robustness of FJLT while completely avoiding multiplications, and is demonstrated for image denoising, scene classification, and video categorization.

Host: Ashok Veeraraghavan

Biography of Shiv N. Vitaladevuni:
Shiv Vitaladevuni is a Scientist at the Speech, Language and Multimedia Unit at Raytheon BBN Technologies. His research interests include semantic analysis of images and videos, machine learning and optimization. Dr. Vitaladevuni received his Ph.D. from University of Maryland under Prof. Larry Davis, specializing in video-based action recognition. Prior to joining BBN, he worked for 3 years at Howard Hughes Medical Institute (HHMI) building an end-to-end system for reconstructing neuronal connectivity from electron micrographs. Dr. Vitaladevuni is the Co-PI in an IARPA program in which he leads the research in discovering theories for financial market behavior from heterogeneous data. He is a contributor to several ongoing programs at BBN including, IARPA ALADDIN for web video analysis, DARPA MADCAT for document image analysis, and DARPA DCAPS for detecting subtle psychological distress indicators from web text and EEG.

<<   January 2017   >>
1 2 3 4 5 6 7
8 9 10 11 12 13 14
15 16 17 18 19 20 21
22 23 24 25 26 27 28
29 30 31

Search for Events