Thursday, July 3, 2008

Adobe Advances Rich Media Search on the Web

Adobe Flash Technology Enhances Search Results for Dynamic Content and Rich Internet Applications

Adobe Systems Incorporated (Nasdaq:ADBE) today announced the company is teaming up with search industry leaders to dramatically improve search results of dynamic Web content and rich Internet applications (RIAs). Adobe is providing optimized Adobe® Flash® Player technology to Google and Yahoo! to enhance search engine indexing of the Flash file format (SWF) and uncover information that is currently undiscoverable by search engines. This will provide more relevant automatic search rankings of the millions of RIAs and other dynamic content that run in Adobe Flash Player. Moving forward, RIA developers and rich Web content producers won’t need to amend existing and future content to make it searchable — they can now be confident it can be found by users around the globe.

The openly published SWF specification describes the file format used to deliver rich applications and interactive content via Adobe Flash Player, which is installed on more than 98 percent of Internet-connected computers. Although search engines already index static text and links within SWF files, RIAs and dynamic Web content have been generally difficult to fully expose to search engines because of their changing states — a problem also inherent in other RIA technologies.

“Until now it has been extremely challenging to search the millions of RIAs and dynamic content on the Web, so we are leading the charge in improving search of content that runs in Adobe Flash Player,” said David Wadhwani, general manager and vice president of the Platform Business Unit at Adobe. “We are initially working with Google and Yahoo! to significantly improve search of this rich content on the Web, and we intend to broaden the availability of this capability to benefit all content publishers, developers and end users.”

Google has already begun to roll out Adobe Flash Player technology incorporated into its search engine. With Adobe’s help, Google can now better read the content on sites that use Adobe Flash technology, helping users find more relevant information when conducting searches. As a result, millions of pre-existing RIAs and dynamic Web experiences that utilize Adobe Flash technology, including content that loads at runtime, are immediately searchable without the need for companies and developers to alter them.

“Google has been working hard to improve how we can read and discover SWF files,” said Bill Coughran, senior vice president of engineering at Google. “Through our recent collaboration with Adobe, we now help Web site owners that choose to design sites with Adobe Flash software by indexing this content better. Improving how we crawl dynamic content will ultimately enhance the search experience for our users.”

Yahoo! also expects to deliver improved Web search capabilities for SWF applications in a future update to Yahoo! Search. “Yahoo! is committed to supporting webmaster needs with plans to support searchable SWF and is working with Adobe to determine the best possible implementation,” said Sean Suchter, vice president Yahoo! Search Technology Engineering.

“Designers and Web developers have long been frustrated that search engines couldn’t better access the information within their content created with Flash technology. It’s great to see Adobe and the search engines working directly together to improve the situation,” said Danny Sullivan, editor-in-chief, SearchEngineLand.com. “The changes should help unlock information that’s previously been ‘invisible’ and will likely result in a better experience for searchers.”

Adobe Flash Player is the world’s most pervasive client runtime that delivers unparalleled creative options, highly engaging user experiences, stunning audio/video playback, and nearly universal reach.

Digg this
Add to Sphinn

Google Improved Flash Indexing

Words from Google

We've received numerous requests to improve our indexing of Adobe Flash files. Today, Ron Adler and Janis Stipins—software engineers on our indexing team—will provide us with more in-depth information about our recent announcement that we've greatly improved our ability to index Flash.

Q: Which Flash files can Google better index now?
We've improved our ability to index textual content in SWF files of all kinds. This includes Flash "gadgets" such as buttons or menus, self-contained Flash websites, and everything in between.

Q: What content can Google better index from these Flash files?
All of the text that users can see as they interact with your Flash file. If your website contains Flash, the textual content in your Flash files can be used when Google generates a snippet for your website. Also, the words that appear in your Flash files can be used to match query terms in Google searches.

In addition to finding and indexing the textual content in Flash files, we're also discovering URLs that appear in Flash files, and feeding them into our crawling pipeline—just like we do with URLs that appear in non-Flash webpages. For example, if your Flash application contains links to pages inside your website, Google may now be better able to discover and crawl more of your website.

Q: What about non-textual content, such as images?
At present, we are only discovering and indexing textual content in Flash files. If your Flash files only include images, we will not recognize or index any text that may appear in those images. Similarly, we do not generate any anchor text for Flash buttons which target some URL, but which have no associated text.

Also note that we do not index FLV files, such as the videos that play on YouTube, because these files contain no text elements.

Q: How does Google "see" the contents of a Flash file?
We've developed an algorithm that explores Flash files in the same way that a person would, by clicking buttons, entering input, and so on. Our algorithm remembers all of the text that it encounters along the way, and that content is then available to be indexed. We can't tell you all of the proprietary details, but we can tell you that the algorithm's effectiveness was improved by utilizing Adobe's new Searchable SWF library.

Q: What do I need to do to get Google to index the text in my Flash files?
Basically, you don't need to do anything. The improvements that we have made do not require any special action on the part of web designers or webmasters. If you have Flash content on your website, we will automatically begin to index it, up to the limits of our current technical ability (see next question).

That said, you should be aware that Google is now able to see the text that appears to visitors of your website. If you prefer Google to ignore your less informative content, such as a "copyright" or "loading" message, consider replacing the text within an image, which will make it effectively invisible to us.

Q: What are the current technical limitations of Google's ability to index Flash?
There are three main limitations at present, and we are already working on resolving them:

1. Googlebot does not execute some types of JavaScript. So if your web page loads a Flash file via JavaScript, Google may not be aware of that Flash file, in which case it will not be indexed.

2. We currently do not attach content from external resources that are loaded by your Flash files. If your Flash file loads an HTML file, an XML file, another SWF file, etc., Google will separately index that resource, but it will not yet be considered to be part of the content in your Flash file.

3. While we are able to index Flash in almost all of the languages found on the web, currently there are difficulties with Flash content written in bidirectional languages. Until this is fixed, we will be unable to index Hebrew language or Arabic language content from Flash files.

We're already making progress on these issues, so stay tuned!

Digg this
Add to Sphinn

Google learns to crawl Flash

Google has been developing a new algorithm for indexing textual content in Flash files of all kinds, from Flash menus, buttons and banners, to self-contained Flash websites. Recently, we've improved the performance of this Flash indexing algorithm by integrating Adobe's Flash Player technology.

In the past, web designers faced challenges if they chose to develop a site in Flash because the content they included was not indexable by search engines. They needed to make extra effort to ensure that their content was also presented in another way that search engines could find.

Now that we've launched our Flash indexing algorithm, web designers can expect improved visibility of their published Flash content, and you can expect to see better search results and snippets. There's more info on the Webmaster Central blog about the Searchable SWF integration.

Digg this
Add to Sphinn

Keyword Research : Where to Start?

Keyword research is the first thing you do when planning a website. You do it before thinking over the site architecture and creating the content. Keyword research is the what you do to estimate your competition and create a promotion strategy.

But where to start? How to find the correct core terms that you will then explore and expand? What you can do first:

1. Make a list of most essential aspects of your business;
2. Think what your visitors or customers would be most interested to see;
3. Learn what your competitors use (but don’t copy their strategy - sort out only most essential and useful terms);
4. Explore your niche essential vocabulary - check out dictionaries and glossaries.

While the first three are more or less clear, number 4 needs further information, I guess. I have a collection of my favorite sources I check regularly to get help with keyword research and content creation. And I am going to share my collection now:

The free dictionary (available with the FireFox search plugin) list a wealth of useful information for any term:

  • your keyword definition(s)
  • set phrases/ idioms (use them to lighten up your writing style)
  • examples of usage
  • words of the same root (inflected words and derivations)
  • thesaurus
  • classic literature mentions
  • keyword phrases
  • related acronyms

Wordnet will provide you with the definitions and sample sentences using your key term.

Rhyme Zone will define your term and find:

  • rhymes (can be real fun to play with by the way)
  • synonyms
  • antonyms

Synonym.com (as the name suggests) will show you the word synonyms and antonyms. The dictionary database is really not bad but they really have to work on usability and disturbing pop-ups the keep popping in.

Cambridge online dictionary will list most essential definitions and keyword phrases.

Meta Glossary finds the word definitions, related words and synonyms

Reference.com (with the FireFox search plugin) shows your core term synonyms and definitions.

Merriam-Webster dictionary lists definitions and derived terms.

Urban Dictionary provides related terms, definitions and examples created by users (FireFox search plugin available).

Digg this
Add to Sphinn