Activity
1M followers
Experience
-
-
-
Co-founder, Advisor
Element Inc
-
Facebook
-
Cofounder and Chief Scientist
MuseAmi
-
Owner
YLC Consulting LLC
-
-
-
AT&T Bell Laboratories
-
research scientist
THALES - Laboratoire Central de Recherche
-
Skills
Publications
-
Feature Learning and Deep Architectures: New Directions for Music Informatics
Journal of Intelligent Information Systems / Springer
-
Moving Beyond Feature Design: Deep Architectures and Automatic Feature Learning in Music Informatics
Proc. of the International Society of Music Information Retrieval (ISMIR) 2012
Projects
-
LAGR
-
LAGR was a DARPA competition in autonomous vision-based off-road robot navigation. Nine labs competed in many races through complex off-road terrain over the course of 3 years. In the first year, our team achieved the competition's 3-year goal of improving the baseline run-time by a factor of 2, forcing the government to up the goal to a factor of 3.
Using visual input for navigation is essential in unstructured outdoor environments, which can cause simpler sensors such as GPS…LAGR was a DARPA competition in autonomous vision-based off-road robot navigation. Nine labs competed in many races through complex off-road terrain over the course of 3 years. In the first year, our team achieved the competition's 3-year goal of improving the baseline run-time by a factor of 2, forcing the government to up the goal to a factor of 3.
Using visual input for navigation is essential in unstructured outdoor environments, which can cause simpler sensors such as GPS, odometry, and laser scanners to drift severely or become unusably noisy. As an alternative to such sensors, I contributed a novel, fast form of visual odometry. Our efficient “hybrid visual odometry” method used 5% of the CPU time, in contrast to full VO methods which require an entire CPU. Despite this, it maintained comparable accuracy to full VO over diverse terrain types. This makes it particularly suitable for consumer robots, extraterrestrial rovers, and other platforms with limited computing power.
This contribution was described in the ICRA 2009 paper Efficient Off-Road Localization Using Visually Corrected Odometry, and contributed to our JFR 2009 paper, A Multi-Range Architecture for Collision-Free Off-Road Robot Navigation.Other creatorsSee project -
DjVu
-
DjVu (pronounced "déjà vu") is a digital document format with advanced compression technology.
I initiated and directed the project.Other creatorsSee project -
Check Recognition with Deep Learning
-
The first practical system for reading the amount on bank checks.
It was deployed by NCR first in ATM machines in 1995, then in high-speed check reading machines in 1996. At some point in the late 90's these systems read an estimated 10 to 20% of all the checks in the US.
It used a convolutional neural net, combined with a so-called graph-transformer network, a form of non conditional random field in which the solution space can be represented by a treillis.
Other creatorsSee project
Other similar profiles
Explore top content on LinkedIn
Find curated posts and insights for relevant topics all in one place.
View top content