|
Major Google Algorithm Update Expected
|
|
Posted at
14-May-2007 07:49:38 PM
|
|
|
Webmasters all over have noticed variations in google search engine results. Several of them have noticed reshuffling and possible tweaking on Google data centers during the last two weeks or so. Some Websites which were ranking in 1st page on 5th and 6th positions have been dropped to second page. some people have noticed variation in rankngs of various data centers for the same keywords.
Links to this post
|
|
|
|
|
|
|
|
|
|
Email this | Comments
|
|
|
|
|
|
Yahoo Slurp 3.0
|
|
Posted at
14-June-2007 10:55:45 PM
|
|
|
Over the past few weeks, Yahoo has been preparing for the latest version of the Yahoo! Search crawler “Yahoo! Slurp 3.0” with some infrastructure updates, which has led to variance in yahoo crawl behavior. According sources of yahoo “With everything now in place, the rollout has officially begun.” As a result of phase rolled out , over the next several weeks, we may see the following changes 1)The crawlers will start crawling from a different and much smaller set of IP addresses 2) If you're using IP-based recognition for yahoo crawlers, you might see a drop in crawl/coverage from Yahoo! 3) The crawlers will also publish a new user-agent, 'Yahoo! Slurp/3.0.' </li> 4) Existing robots.txt instructions for 'Slurp' or 'Yahoo! Slurp' will work, however, directives specific to 'Slurp/2.0,' will not be recognized by the new crawler. These above stated changes will affect the main Yahoo! Web Search crawlers.
Links to this Post
|
|
|
|
|
|
|
|
|
|
Email this | Comments
|
|
|
|
|
|
Avoid the Use of .exe in URLs!
|
|
Posted at
14-June-2007 10:55:45 PM
|
|
|
|
Google accepts to crawl all file extensions
such as .php, .asp., .html, .htm. However, there are certain extensions that
Google can't index such as .exe.
According to Matt Cutts, “But there are some file
extensions that are mostly binary data, such as .exe, where the vast majority
of the time the data would be meaningless blobs, so there are a few extensions
to avoid. If your files are named example.dll or example.bin and you don’t see
Google crawling pages with that file extension, I’d recommend changing your
file extension to something else.” Links to this Post
|
|
|
|
|
|
|
|
|
|
Email this | Comments
|
|
|
|
|
|
Robots Exclusion Protocol Directives
|
|
Posted at
14-July-2007 10:57:14 PM
|
|
|
Common REP Directives The following list are all the major REP features currently implemented by Google, Microsoft, and Yahoo!.
1.Robots.txt Directives
Directive: Disallow
Impact
: Tells a crawler not to crawl your site or parts of your site -- your
site's robots.txt still needs to be crawled to find this directive, but
the disallowed pages will not be crawled
Use Cases: 'No crawl' pages from a site. This directive in the default syntax prevents specific path(s) of a site from crawling
Directive: Allow Impact
: Tells a crawler the specific pages on your site you want indexed so
you can use this in combination with Disallow. If both Disallow and
Allow clauses apply to a URL, the most specific rule – the longest rule
– applies.
Use Cases: This is useful in particular in
conjunction with Disallow clauses, where a large section of a site is
disallowed, except a small section within it.
Directive: $ Wildcard Support
Impact
: Tells a crawler to match everything from the end of a URL -- large
number of directories without specifying specific pages
Use
Cases: 'No Crawl' files with specific patterns, for e.g., files with
certain file types that always have a certain extension, say '.pdf',
etc.
Directive: * Wildcard Support
Impact : Tells a crawler to match a sequence of characters (available by end of June) Use Cases: 'No Crawl' URLs with certain patterns, for e.g., disallow URLs with session ids or other extraneous parameters, etc.
Directive: Sitemaps Location
Impact : Tells a crawler where it can find your sitemaps.
Use Cases: Point to other locations where feeds exist to point the crawlers to the site's content
2. HTML META Directives
Directive: NOINDEX META Tag
Impact : Tells a crawler not to index a given page
Use Cases: Don't index the page. This allows pages that are crawled to be kept out of the index.
Directive: NOFOLLOW META Tag
Impact : Tells a crawler not to follow a link to other content on a given page
Use
Cases: Prevent publicly writeable areas to be abused by spammers
looking for link credit. By NOFOLLOW, you let the robot know that you
are discounting all outgoing links from this page.
Directive: NOSNIPPET META Tag
Impact : Tells a crawler not to display snippets in the search results for a given page
Use Cases: Present no abstract for the page on Search Results.
Directive: NOARCHIVE / NOCACHE META Tag
Impact : Tells a search engine not to show a "cached" link for a given page
Use Cases: Do not make a copy of the page available to users from the Search Engine cache.
Directive: NOODP META Tag
Impact : Tells a crawler not to use a title and snippet from the Open Directory Project for a given page Use Cases: Do not use the ODP (Open Directory Project) title and abstract for this page in Search."
In addition to the above there are other directives supported only by Google
UNAVAILABLE_AFTER
Meta Tag - Tells a crawler when a page should "expire", i.e., after
which date it should not show up in search results.
NOIMAGEINDEX Meta Tag - Tells a crawler not to index images for a given page in search results.
NOTRANSLATE Meta Tag - Tells a crawler not to translate the content on a page into different languages for search results.
|
|
|
|
|
|
|
|
|
|
Email this | Comments
|
|
|
|
|
|
Google Search Suggestions
|
|
Posted at
14-July-2007 10:57:14 PM
|
|
|
Google is educating users about the 'related search
suggestions' and how it helps users narrow their search and get what they are
looking for.
According to the Google Blog "Google
identifies related search queries by evaluating data from multiple sources. In
related search suggestions, Google provides users with valuable automatically
generated suggestions, that would be most appropriate to their query. While
these related search suggestions are usually shown below the search results,
the algorithm sometimes causes them to display above the search results. Links to this post
|
|
|
|
|
|
|
|
|
|
Email this | Comments
|
|
|
|
|
|
Keep Safe From Search Boxes XSS Exploits!
|
|
Posted at
14-August-2006 10:57:52 PM
|
|
|
|
According to Matt Cutts , small, medium as well as big websites
should be very careful about XSS holes on their websites. These XSS holes can
vary in forms, but one of the most common forms is the 'search boxes'. Clicking
on these 'search boxes' is a big NO! NO!.
In the event, that you may notice a dip in your website's
Google rankings, then you should initiate a few searches that would help you to
find, if anyone has injected spammy or porn content on your website.
For example: If your website is example.com, you can run
a few queries such as [site:example.com porn] or [site:example.com biaxin] or
[site:example.com viagra] to see whether you run across unexpected results. Links to this post
|
|
|
|
|
|
|
|
|
|
Email this | Comments
|
|
|
|
|
|
Google Web Master Tools Presentation
|
|
Posted at
15-September-2007 12:45:47 AM
|
|
|
|
Eric Lander's Blog, has a
presentation that addresses the various tools of Google Webmaster Tools,
their importance and their working from the perspective of SEO (Search Engine Optimization).
This presentation is specifically targeted towards users who are interested in
SEO using Google Webmaster Tools
The main aim of this presentation is to make the users aware
of the Tools present within the Google Webmaster Tools. The most prominent ones
being.
- Diagnostics
- Statistics
- Links
- Sitemaps
- Tools
|
|
|
|
|
|
|
|
|
|
Email this | Comments
|
|
|
|
|