Adobe helps search engines to index Flash-based content

By Tim Conneally | Published July 1, 2008, 4:00 PM

Adobe Systems Inc. announced today that it is working with both Google and Yahoo to improve the search engine indexing of Flash (.SWF) files -- a capability search engines have had for years, but haven't used.

Search engine giants Google and Yahoo are utilizing Adobe's recently-updated Flash Player standard to help make Flash-based content searchable. Google has already launched its indexing mechanism, with Yahoo reportedly next in line to do the same.

The SWF specification is in its ninth iteration, and has been openly available for consideration for some time, but until this version, was not fully utilized by many due to licensing fees. Now, however, as a part of the Open Screen Project, SWF is more openly available, which allows Google to officially roll out a capability it's had for years.

When the spec was opened in May 2008, Rob Savoye, head of the Gnash (GNU Flash Player) development team, said, "Adobe's licensing had acted as a bottleneck, as you were allowed to read the specifications and able to build using SWF but prohibited from building software for SWF file playback. Or, as [Dave] McAllister [Adobe's Director of Standards and Open Source] put it, you: 'Couldn't build anything that looked or smelled like a Flash player - only Adobe could do it.' As of May 1, though, you can build your own Flash player and embed Flash into an application."

Graphical, audio and video content, such as that found in FLV files common to YouTube, is still not searchable yet. Although Flash gadgets, buttons, menus, and self-contained Web sites can now be found through Google, the mechanism is limited to textual content and embedded URLs. Flash files without anchor text will remain invisible to searches.

However, a Web developer need not make any special modifications to pre-existing content to enable it to be index, since Google will automatically index content. Therefore, if there is textual material that site managers do not want Google to index, it should be either removed or replaced with a graphical representation.

Creating a robots.txt file may also work for developers wanting to exclude Flash content from being indexed, since other methods involve marking up HTML and may not be applicable to some Flash-based sites' design.

Google's Webmaster Central Blog lists three main limitations which the team is currently working on resolving:

  1. Google's bots can be thwarted by some JavaScript, so Flash files executed via JavaScript are likely not going to be indexed.
  2. Flash files and their linked external content (HTML, XML, or other SWF files) are indexed separately, so Flash files are effectively bereft of content attached from external resources.
  3. Hebrew, Arabic, and other "bi-directional" languages (where numeric content reads left-to-right, but text reads right-to-left) are not yet supported, however this is a common condition.

View comments by with a score of at least

PDC 2009: What have we learned this week?

There was the freebie that no one will forget, the heebie-jeebies courtesy of Scott Guthrie, and a teensy bit clearer picture of how this cloud thingie should work.

Live report: Will Google Chrome OS change Linux?

The mysteries of just what Chrome OS is, and how much of an operating system it truly is, may be resolved today.

PDC 2009: Microsoft cares about Web browser performance

The effort to give users of the world's dominant Web browser the impression of quality, is a personal one for the man who leads that battle.

Nokia re-affirms its commitment to Symbian, sort of

Maemo won't necessarily be replacing Symbian in the Nokia N-Series, but that's definitely a place where it will be found.

E-book readers will be in short supply this holiday season

E-readers are hot this year, and a lot of compelling new products have been released, but are there enough electrophoretic displays to go around?

Sony looks to finally open a single storefront for downloads

Sony has had many different download portals for movies, music, e-books, and games, and now it's looking to make a single shop for all of it.

Tuning out the tablet: Time to give the endless speculation a rest

Wide Angle Zoom: Wishing and hoping and thinking and praying....won't put an iTablet on the market.

Five improvements for IT managers in 2010

If businesses are to improve their efficiency for next year, they need to stop and reassess the basic tenets of their job.

AOL's spinoff from Time Warner to shed 2,500 jobs

As AOL moves toward become an independent company again, it will cut nearly a third of its workforce.

Gartner: SMS-based money transfer will be bigger than mobile browsing, search

Gartner issues its predictions for the 10 things our phones will be doing in 2012.

Don't forget to upgrade to Firefox 3.6 beta 3 today

Mozilla has released the latest beta its Firefox 3.6 browser software, just over one week after beta 2.