Recent Posts
  Keep Safe From Search Boxes XSS Exploits!
  Google Search Suggestions
  Robots Exclusion Protocol Directives
  Avoid the Use of .exe in URLs!
  Yahoo Slurp 3.0
 
  Blogroll
  Google Offical Blog
  Yahoo Offical Blog
  Mattcutts Blog
  SEO Blog
 
  Recent Posts
  August , 2006
  July , 2007
  June , 2007
  May , 2007
  September , 2007
 
 
Major Google Algorithm Update Expected
Posted at 14-May-2007 07:49:38 PM
 
Webmasters all over have noticed variations in google search engine results. Several of them  have noticed reshuffling and possible tweaking on Google data centers during the last two weeks or so. Some Websites which were ranking in 1st page on 5th and 6th positions have been dropped to second page. some people have noticed variation in rankngs of various data centers for the same keywords.

Links to this post
 
 
fd fd
 

Email this | Comments

 
Yahoo Slurp 3.0
Posted at 14-June-2007 10:55:45 PM
 
Over the past few weeks, Yahoo has been preparing for the latest version of the Yahoo! Search crawler “Yahoo! Slurp 3.0” with some infrastructure updates, which has led to variance in yahoo crawl behavior.
According sources of yahoo “With everything now in place, the rollout has officially begun.” As a result of phase rolled out , over the next several weeks, we may see the following changes
    1)The crawlers will start crawling from a different and much smaller set of IP addresses
2)  If you're using IP-based recognition for yahoo crawlers, you might see a drop in crawl/coverage from Yahoo!
3)   The crawlers will also publish a new user-agent, 'Yahoo! Slurp/3.0.' </li>
4)   Existing robots.txt instructions  for 'Slurp' or 'Yahoo! Slurp' will work, however, directives specific to 'Slurp/2.0,' will not be recognized by the new crawler.
These above stated changes will affect the main Yahoo! Web Search crawlers.

Links to this Post

 
 
fd fd
 

Email this | Comments

 
Avoid the Use of .exe in URLs!
Posted at 14-June-2007 10:55:45 PM
 

 Google accepts to crawl all file extensions such as .php, .asp., .html, .htm. However, there are certain extensions that Google can't index such as .exe.

According to Matt Cutts, “But there are some file extensions that are mostly binary data, such as .exe, where the vast majority of the time the data would be meaningless blobs, so there are a few extensions to avoid. If your files are named example.dll or example.bin and you don’t see Google crawling pages with that file extension, I’d recommend changing your file extension to something else.”

Links to this Post

 
 
fd fd
 

Email this | Comments

 
Robots Exclusion Protocol Directives
Posted at 14-July-2007 10:57:14 PM
 
Common REP Directives
The following list are all the major REP features currently implemented by Google, Microsoft, and Yahoo!.


1.Robots.txt Directives

Directive: Disallow

Impact : Tells a crawler not to crawl your site or parts of your site -- your site's robots.txt still needs to be crawled to find this directive, but the disallowed pages will not be crawled

Use Cases: 'No crawl' pages from a site. This directive in the default syntax prevents specific path(s) of a site from crawling

Directive: Allow
Impact : Tells a crawler the specific pages on your site you want indexed so you can use this in combination with Disallow. If both Disallow and Allow clauses apply to a URL, the most specific rule – the longest rule – applies.

Use Cases: This is useful in particular in conjunction with Disallow clauses, where a large section of a site is disallowed, except a small section within it.


Directive: $ Wildcard Support

Impact : Tells a crawler to match everything from the end of a URL -- large number of directories without specifying specific pages

Use Cases: 'No Crawl' files with specific patterns, for e.g., files with certain file types that always have a certain extension, say '.pdf', etc.

Directive: * Wildcard Support

Impact : Tells a crawler to match a sequence of characters (available by end of June)
Use Cases: 'No Crawl' URLs with certain patterns, for e.g., disallow URLs with session ids or other extraneous parameters, etc.

Directive: Sitemaps Location

Impact : Tells a crawler where it can find your sitemaps.

Use Cases: Point to other locations where feeds exist to point the crawlers to the site's content

2. HTML META Directives

Directive: NOINDEX META Tag

Impact : Tells a crawler not to index a given page

Use Cases: Don't index the page. This allows pages that are crawled to be kept out of the index.

Directive: NOFOLLOW META Tag

Impact : Tells a crawler not to follow a link to other content on a given page

Use Cases: Prevent publicly writeable areas to be abused by spammers looking for link credit. By NOFOLLOW, you let the robot know that you are discounting all outgoing links from this page.

Directive: NOSNIPPET META Tag

Impact : Tells a crawler not to display snippets in the search results for a given page

Use Cases: Present no abstract for the page on Search Results.

Directive: NOARCHIVE / NOCACHE META Tag

Impact : Tells a search engine not to show a "cached" link for a given page

Use Cases: Do not make a copy of the page available to users from the Search Engine cache.

Directive: NOODP META Tag

Impact : Tells a crawler not to use a title and snippet from the Open Directory Project for a given page
Use Cases: Do not use the ODP (Open Directory Project) title and abstract for this page in Search."

In addition to the above there are other directives supported only by Google

UNAVAILABLE_AFTER Meta Tag - Tells a crawler when a page should "expire", i.e., after which date it should not show up in search results.

NOIMAGEINDEX Meta Tag - Tells a crawler not to index images for a given page in search results.

NOTRANSLATE Meta Tag - Tells a crawler not to translate the content on a page into different languages for search results.
 
 
fd fd
 

Email this | Comments

 
Google Search Suggestions
Posted at 14-July-2007 10:57:14 PM
 

Google is educating users about the 'related search suggestions' and how it helps users narrow their search and get what they are looking for.

According to the Google Blog  "Google identifies related search queries by evaluating data from multiple sources. In related search suggestions, Google provides users with valuable automatically generated suggestions, that would be most appropriate to their query. While these related search suggestions are usually shown below the  search results, the algorithm sometimes causes them to display above the search results.

Links to this post



 
 
fd fd
 

Email this | Comments

 
Keep Safe From Search Boxes XSS Exploits!
Posted at 14-August-2006 10:57:52 PM
 

According to Matt Cutts , small, medium as well as big websites should be very careful about XSS holes on their websites. These XSS holes can vary in forms, but one of the most common forms is the 'search boxes'. Clicking on these 'search boxes' is a big NO! NO!.

In the event, that you may notice a dip in your website's Google rankings, then you should initiate a few searches that would help you to find, if anyone has injected spammy or porn content on your website.

For example: If your website is example.com, you can run a few queries such as [site:example.com porn] or [site:example.com biaxin] or [site:example.com viagra] to see whether you run across unexpected results.

 Links to this post

 
 
fd fd
 

Email this | Comments

 
Google Web Master Tools Presentation
Posted at 15-September-2007 12:45:47 AM
 

Eric Lander's Blog, has a presentation that addresses the various tools of Google Webmaster Tools, their importance and their working from the perspective of SEO (Search Engine Optimization). This presentation is specifically targeted towards users who are interested in SEO using Google Webmaster Tools

The main aim of this presentation is to make the users aware of the Tools present within the Google Webmaster Tools. The most prominent ones being.

  1. Diagnostics
  2. Statistics
  3. Links
  4. Sitemaps
  5. Tools 
 
 
fd fd
 

Email this | Comments