Google Operating System Unofficial news and tips about Google

  • Subscribe to our RSS feed.
  • Twitter
  • StumbleUpon
  • Reddit
  • Facebook
  • Digg

Thursday, 13 June 2013

How Google's Image Recognition Works

Posted on 08:16 by Unknown
Just like Google Drive, Google+ Photos uses some amazing image recognition technology to make photos searchable, even if they don't have captions or useful filenames. "This is powered by computer vision and machine learning technology, which uses the visual content of an image to generate searchable tags for photos combined with other sources like text tags and EXIF metadata to enable search across thousands of concepts like a flower, food, car, jet ski, or turtle," explains Google.

Google acquired DNNresearch, a start-up created by Professor Geoffrey Hinton and two of his graduate students at the University of Toronto. They built "a system which used deep learning and convolutional neural networks and easily beat out more traditional approaches in the ImageNet computer vision competition designed to test image understanding." Google built and trained similar large-scale models and found that this approach doubles the average precision, compared to other object recognition methods. "We took cutting edge research straight out of an academic research lab and launched it, in just a little over six months," says Chuck Rosenberg, from the Google Image Search Team.

The paper, titled "ImageNet Classification with Deep Convolutional Neural Networks" [PDF], explains how this works. It uses supervised learning, 7 hidden weight layers and feature extractors learned from the data. "Our neural net has 60 million real-valued parameters and 650,000 neurons. It overfits a lot. Therefore we train on 224x224 patches extracted randomly from 256x256 images, and also their horizontal reflections."


Google says that the publicly available photo search feature recognizes 1100 tags. "We came up with a set of about 2000 visual classes based on the most popular labels on Google+ Photos and which also seemed to have a visual component, that a human could recognize visually. In contrast, the ImageNet competition has 1000 classes. As in ImageNet, the classes were not text strings, but are entities, in our case we use Freebase entities which form the basis of the Knowledge Graph used in Google search. An entity is a way to uniquely identify something in a language-independent way. (...) Since we wanted to provide only high precision labels, we also refined the classes from our initial set of 2000 to the most precise 1100 classes for our launch."

Some other examples of classes that are recognized: car, dance, kiss, meal, hibiscus, dahlia, sunsets, polar bear, grizzly bear. The system recognizes both generic visual concepts and specific objects. "Unlike other systems we experimented with, the errors which we observed often seemed quite reasonable to people. The mistakes were the type that a person might make - confusing things that look similar."
Email ThisBlogThis!Share to XShare to FacebookShare to Pinterest
Posted in Google Photos | No comments
Newer Post Older Post Home

0 comments:

Post a Comment

Subscribe to: Post Comments (Atom)

Popular Posts

  • More People Can Buy Apps from the Android Market
    If there's one thing that Google should do to improve Android, it's developing a better Android Market. Google's app store has a...
  • Could Google Save Yahoo from Microsoft?
    Microsoft is taking over Yahoo! by Gnal. Licensed as Creative Commons Attribution . Even if it's hard to believe that Yahoo will accept...
  • Google's Marketing Dashboard
    MediaPost reports that Google wants to integrate the reporting features from all of its ad products to provide a "fully functional mar...
  • Watch a Video in YouTube's HTML5 Player
    In January, YouTube launched a player that used the HTML5 video tag. To try this player, you have to go to youtube.com/html5 and enable th...
  • Swipe Navigation in the Mobile Gmail Site
    One month ago, Google updated the Gmail app for iOS and added a swipe gesture that lets you move between conversations without having to re...
  • A Bogus DMCA Takedown Request (Part 3)
    I've mentioned in the previous two posts that Inspection 12 sent a DMCA notice for one of my posts, Google took it offline and reject...
  • The Old Image Search, Still Available
    The old Google Image Search interface is still available in the OneBox result that's displayed for some Google searches like [tropical b...
  • Google+ Photo Search With Image Recognition
    Last year, Google Drive added an advanced image search feature powered by Goggles that recognizes objects and uses OCR technology to extrac...
  • Search Engine Comparison Poll: The Results
    Six days ago, I posted a poll that asked you to evaluate the quality of the first results from Google, Yahoo, Windows Live. You had to ente...
  • Bring the Mashups to Google Maps
    Google Maps API was the most successful API ever created by Google and the tool behind a lot of cool mashups available on the web today. Th...

Categories

  • Acquisitions (17)
  • Ads (16)
  • AJAX Search (4)
  • Android (83)
  • Annoyances (7)
  • API (9)
  • April Fools Day (2)
  • Blog Search (4)
  • Blogger (20)
  • Book Search (11)
  • DMCA (4)
  • Easter Egg (18)
  • FeedBurner (4)
  • Firefox extensions (10)
  • Froogle (1)
  • Game (3)
  • gm (1)
  • Gmail (161)
  • Google Analytics (4)
  • Google Apps (17)
  • Google Bookmarks (7)
  • Google Buzz (14)
  • Google Calendar (17)
  • Google Cast (3)
  • Google Checkout (5)
  • Google Chrome (105)
  • Google Chrome OS (28)
  • Google Co-op (9)
  • Google Contacts (9)
  • Google Desktop (5)
  • Google Dictionary (8)
  • Google Docs (80)
  • Google Drive (41)
  • Google Earth (22)
  • Google Gears (5)
  • Google Goggles (7)
  • Google Groups (2)
  • Google Hangouts (4)
  • Google Health (2)
  • Google Instant (15)
  • Google Keep (5)
  • Google Latitude (5)
  • Google Local (9)
  • Google Maps (80)
  • Google Music (3)
  • Google News (20)
  • Google Notebook (9)
  • Google Now (14)
  • Google Pack (2)
  • Google Phone (9)
  • Google Photos (14)
  • Google Play (3)
  • Google Plus (29)
  • Google Profiles (5)
  • Google Promos (2)
  • Google Reader (47)
  • Google Scholar (1)
  • Google Sites (1)
  • Google Suggest (13)
  • Google Takeout (1)
  • Google Talk (19)
  • Google Toolbar (7)
  • Google Translate (38)
  • Google Trends (9)
  • Google TV (4)
  • Google Update (1)
  • Google Video (11)
  • Google Voice (6)
  • Google Wallet (2)
  • Google Wave (3)
  • Greasemonkey (10)
  • iGoogle (32)
  • Image Search (31)
  • InOut (13)
  • Knowledge (14)
  • Mobile (133)
  • Month in review (1)
  • Music (3)
  • Nostalgia (6)
  • OneBox (19)
  • orkut (10)
  • Page Creator (1)
  • Picasa (5)
  • Picasa Web Albums (22)
  • SearchMash (2)
  • Security (10)
  • Social (32)
  • Software (4)
  • Spam (2)
  • Tips (86)
  • Universal Search (3)
  • User interface (116)
  • Visualization (9)
  • Voice Search (14)
  • Web History (7)
  • Web Search (202)
  • Webmasters (5)
  • Windows Live (5)
  • Yahoo (8)
  • Yahoo Pipes (2)
  • YouTube (122)

Blog Archive

  • ▼  2013 (364)
    • ►  September (1)
    • ►  August (60)
    • ►  July (60)
    • ▼  June (56)
      • Full Google Reader Backup
      • Lorem Ipsum Google Translate
      • 2 Years of Google+
      • YouTube's Subscription Reminder
      • Office Document Editor for Chromebooks
      • Google+ Photos Trash
      • Find the Number of Gmail Search Results
      • Google Play Presents: Galaxy S4 and HTC One
      • Google+ Photos App for Chromebooks
      • 1 Billion Google Safe Browsing Users
      • YouTube And a Distributed Denial-of-Service Attack
      • Google Now's Topics Page Is Back
      • Attach Images in Gmail Using Drag and Drop
      • Gay Google (2013)
      • Google's Tilde Operator No Longer Works
      • Finding Synonyms Using Google Search
      • The Feed Reading Playground Is Now Open
      • YouTube's Smarter Video Player
      • YouTube Collections
      • Gmail's Quick Action Button for YouTube Messages
      • Watch Now: a New YouTube Experiment
      • Google Mine
      • A Google Reader Puzzle From 2007
      • The Most Well-Connected Google Service
      • AdSense's Ping Pong Easter Egg
      • From Google Reader to Feedly
      • Print Google Forms
      • New Google+ Notifications
      • Google Now Wallpapers
      • Google Tests a New Mobile Gmail Interface
      • The Odd One Out
      • Ads in Gmail's Promotions Tab
      • Top 10 Google Play Services Reviews
      • Google Promotes Map Maker
      • Knowledge Graph and Google Bombs
      • Google's Unified Storage, Now Available
      • Google No Longer Mentions Data Sources
      • Google Stats
      • Google's Calorie Counter, Not Just for Voice Search
      • Chrome Frame Discontinued
      • How Google's Image Recognition Works
      • Google's Mobile Quick View Missing
      • Google Cloud Print App for Android
      • Google Shows Your Recent Sign-ins
      • Navigation Arrows in Google Image Search
      • New Nexus 7 User
      • New UI for Related Searches in Google Image Search
      • Google's CalDav and CardDav APIs for Everyone
      • The Stock Android 4.2 Keyboard in Google Play
      • Google Currency Conversion Tips
      • Find Your Photos Using Google Search
      • Conversational Voice Search in Chrome for iOS
      • Ode to 3GS
      • The Curious Case of Google Hangouts History
      • Google's Calorie Counter
      • Bigger Thumbnails in Google Search for Tablets
    • ►  May (59)
    • ►  April (48)
    • ►  March (47)
    • ►  February (29)
    • ►  January (4)
  • ►  2012 (134)
    • ►  December (14)
    • ►  November (18)
    • ►  October (26)
    • ►  September (5)
    • ►  August (8)
    • ►  July (17)
    • ►  June (24)
    • ►  May (4)
    • ►  April (18)
  • ►  2011 (13)
    • ►  January (13)
  • ►  2010 (487)
    • ►  December (47)
    • ►  November (37)
    • ►  October (44)
    • ►  September (44)
    • ►  August (55)
    • ►  July (44)
    • ►  June (43)
    • ►  May (54)
    • ►  April (48)
    • ►  March (40)
    • ►  February (28)
    • ►  January (3)
  • ►  2008 (65)
    • ►  February (13)
    • ►  January (52)
  • ►  2007 (435)
    • ►  December (60)
    • ►  November (55)
    • ►  October (57)
    • ►  September (64)
    • ►  August (59)
    • ►  July (70)
    • ►  June (59)
    • ►  May (11)
Powered by Blogger.

About Me

Unknown
View my complete profile