|
144 Ponce de Leon Ave NE, Apt # 1214,
|
OBJECTIVE
Seeking an internship for Summer
2008.
RESEARCH INTERESTS
I am interested in the areas of Pattern Recognition, Computer Vision and Digital Signal Processing Applications. In particular, I am interested in recognition algorithms for multi-modal content, metadata generation and metadata-assisted information retrieval techniques.
EDUCATION
|
PhD in ECE
Georgia Institute of Technology, Atlanta, USA Advisor: Prof. Monson H. Hayes III Current Grade Point Average: 3.86/4.00 |
Aug 2006 - present
|
|
Master of Science in ECE
Georgia Institute of Technology, Atlanta, USA Current Grade Point Average: 3.80/4.00 |
Aug 2004 - May 2006
|
|
Bachelor of Engineering in Electronics and Communication Engineering
PSG College of Technology, Coimbatore, India Degree Grade Point Average: 3.86/4.00 |
Aug 2000 - May 2004 |
PROFESSIONAL EXPERIENCE
|
School of ECE, Georgia Tech, Atlanta, USA
Research Assistant Working on Handwritten Mathematical Content Recognition with Prof. Monson H. Hayes III. Funded by the |
Jan 2005 - present |
|
Texas Instruments Educational & Productivity Solutions, Dallas, USA
Summer Intern Worked on development of TI-nspire and continued research on Handwritten Mathematical Content Recognition. |
Jun 2006 - Aug 2006 |
|
Samsung Electronics Co. Ltd. India Software Operations, Bangalore, India Summer Intern Trained in the basics of Embedded Systems, Multi Function Peripherals and the ARM processor. |
May 2003 - Jun 2003 |
RESEARCH AND ACADEMIC PROJECTS
|
Audio-Video Based Handwritten Mathematical Content Recognition For Classroom Videos |
Aug 2004 - present |
|
Our goal is to design and implement a handwritten mathematical content recognition system. Initially we worked on tablet PC input and later, during my summer internship at Texas Instruments E&PS, we also looked at other input devices that could be used for easier input of mathematical expressions into the calculator. We are currently using classroom videos as the input and have completed the pre-processing stage which involves the extraction of text written on the whiteboard from the foreground objects such as the instructor and also the background i.e. the whiteboard. Since mathematical content is not expressed using cursive handwriting, the character segmentation stage is quite simple. Our current approach to the character recognition problem involve the use of both audio as well as video to recognize the mathematical content written on the whiteboard. When there is some ambiguity in the recognition of a character written on the whiteboard, we will perform Hidden Markov Model(HMM) based speech recognition over a short window of speech to resolve the ambiguity. We are working on the speech recognizer which will recognize mathematical words i.e. numbers, alphabets, math operators and Greek symbols, from a dictionary of a few hundred words. We are also exploring alternate techniques such as word-spotting to overcome some of the problems in the HMM based speech recognizer due to mismatch in the grammar of the input and that of the recognizer. The bigger challenge would be to formulate a probabilistic technique by which the location of the word in the speech segment and the words before and after it would help in synchronizing the audio and video and therefore enable integration of speech information with the video. Decision making in case of conflicts between the video and speech recognizers would be resolved by using a combination of their confidence measures. If the errors in recognition of the symbols written on the whiteboard are considered to be independent of the errors in speech recognition of the same symbol, the recognition accuracy will be increased by using both the audio and video. This research also involves the analysis of the structure of equations by using logical and spatial relationships between recognized symbols and once again, we intend to use speech information to resolve ambiguity. Such an audio-video based approach would also prove to be useful for training the system for new users. |
|
Implementation of Reconstruction from Projections Using Filtered Backprojections |
Aug 2005 - Dec 2005 |
|
This project involved implementation of the filtered backprojection algorithm which reconstructs an image from its projections. The image was reconstructed using different values of ∆θ, different resolutions and also different filters in the filtering stage. Fourier Reconstruction of the image was also attempted. |
|
Fricative Detection |
Jan 2005 - Apr 2005 |
|
The goal of this project was to implement a fricative detector. In addition to implementing fricative detection, segmentation and classification was also implemented for certain other phonemes. This project could either be a part of a knowledge-based phoneme recognition system or be used as the front-end for a statistical speech recognition system, for example, one that uses HMMs or ANNs. |
|
Implementation of Inte polation Schemes and Point-Based Registration for Medical Images |
Jan 2005 - Apr 2005 |
|
Various interpolation schemes were implemented on a Magnetic Resonance Image (MRI) of the Human Brain. The original image was rotated 10 degrees and interpolation was performed to get the intensity values at original pixel positions. This rotation and interpolation was repeated till a complete 360 degree rotation was achieved and the final interpolated image was compared with the original image. The interpolation schemes implemented included Nearest Neighbor, Bilinear and Spline. Next, point-based image registration was implemented on MRI images of the Human Heart with known correspondences for Identity, Rigid Body, Affine and Thin Plate Spline transformations. |
|
Real-Time Implementation of a Speaker Verification System |
Dec 2003 - May 2004 |
|
Our approach was a combination of stochastic and template modeling to implement a text-dependent system. Discrete HMMs were used to match sequences of vector-quantized mel-cepstral coefficients. We made use of the Bakis 5-state HMM and the Baum-Welch algorithm to generate probability measures. Codebook initialization and training was done using the LBG algorithm and the Lloyd?s algorithm respectively. The system was trained and tested using speech samples recorded in different environments. |
SOFTWARE SKILLS
MATLAB,
C, C++, LISP, LabVIEW, Genie, VHDL, TSPICE, PSPICE, Xilinx, ORCAD, Network
Simulator 2, Wavesurfer 1.8.5.
RELEVANT GRADUATE COURSEWORK
|
Digital Image Processing
Medical Image Processing Speech Signal Processing Pattern Recognition Dicrete Event Dynamic Systems Computer Vision Artificial Intelligence Linear Algebra Mathematical Methods for Machine Learning Mathematical Methods for Applied Sciences I |
PROFESSIONAL ACTIVITIES
& AWARDS
|
-
Master of
Ceremony for the National Conference on Signals, Systems and Security (NCSSS)
2002 and the National Conference on emerging trends in VLSI Design And Testing
(NCVDAT) 2003 held at PSG College of Technology, -
Designed and
implemented a circuit for an Automatic Plant Irrigation System which was
displayed at the Golden Jubilee Exhibition held at PSG College of Technology, -
Attended a
six-week training program in LabVIEW in the National Instruments funded Virtual
Instrumentation Laboratory at PSG College of Technology, -
Ranked among top
0.2% in the Tamil Nadu Professional Courses Entrance Examination 2000 conducted
for admission to engineering colleges. |
MISCELLANEOUS
Visa
Status: F-1 Student Visa
REFERENCES
Available
upon request.