开发者

Prevent site configuration info from showing up on Google

开发者 https://www.devze.com 2023-03-19 02:32 出处:网络
I have a site that\'s running WordPress. 开发者_JAVA百科The main page has an embedded Flash player and an imbedded iframe, and for some reason, all the configuration info from the Flash player is sho

I have a site that's running WordPress.

开发者_JAVA百科The main page has an embedded Flash player and an imbedded iframe, and for some reason, all the configuration info from the Flash player is showing up on Google for my site, and nothing else.

How can I have the main site information show up on Google, without having that Flash player config info show up?

And can I customize what shows up at all?

If there's some way to tag the info I don't want to show up, or tag the info I want to show up, I can probably do most ofthe edits myself, I just don't know where to start...

EDIT: I tried most of the suggestions below, and I didn't get anywhere... Any other ideas?

Thanks a lot!


If you don't want Google, or other crawler to access certain parts of your website you should use a robots.txt file. Inside you specify which parts are accessible and which aren't, when the crawlers get to your website will always look for this file for instructions.

You can check some documentation on how to do it here and here


In order to influence what text is used on the google search result try putting this within your head tags

<meta name="description" content="WHATEVER YOU WANT DISPLAYED ON GOOGLE">

Source: http://static.googleusercontent.com/external_content/untrusted_dlcp/www.google.com/en/us/webmasters/docs/search-engine-optimization-starter-guide.pdf

Some more information from google on controling parts of a page. Apparently there are google off/google on tags.

http://perishablepress.com/press/2009/08/23/tell-google-to-not-index-certain-parts-of-your-page/

Hope this helps.


If you want Google to index only part of your pages, you can't follow normal SEO routines. You should provide a mechanism to understand whether the current client (requester) is a robot or not. If yes, then don't render that part. This is the only way. Otherwise, a robot either gets the whole rendered content, or doesn't have access based on robots.txt file (Robot Exclusion Protocol).

Another way (which is not really smart, and can't be guaranteed to work) is to dynamically inject your content into the page via JavaScript. Because AMAIK, robots don't run JavaScript.


As search spiders won't render javascript generated markup (JS is not run as it is client-side in the browser), a quick fix would be to don't output any of flash / markup initially in the HTML document and then use JS to add the flash stuff on load.

Note: as far as I'm aware, Google is currently testing a JS reading spider so this may not work long term.


  1. Google is returning this data because it simply can't find any content where it normally would. Search engines require content - they're not advanced enough to process your multimedia to determine what it's all about.
  2. Google will IGNORE your meta description if it doesn't feel that it reflects your page content (of which there is only iframes and JS)
  3. Use SWFObject to provide alternate content for users without flash (including search engines) - ensure it's not some dinky text like "download flash here" - but a lengthy descriptive content piece about your site or media that they would normally experience if they could experience.
  4. Use robots.txt or <meta name="robots" content="noindex,follow"> for the iframe content to prevent it from being indexed.
  5. For the love of all things holy, please look at reducing the number of JS files and inline JS on your site (i'd recommend WP-minify since it's so obvious that you love plugins)
0

精彩评论

暂无评论...
验证码 换一张
取 消

关注公众号