New Page 1




Selettore risorse   



                                         IL Metodo  |  Grammatica  |  RISPOSTE GRAMMATICALI  |  Multiblog  |  INSEGNARE AGLI ADULTI  |  INSEGNARE AI BAMBINI  |  AudioBooks  |  RISORSE SFiziosE  |  Articoli  |  Tips  | testi pAralleli  |  VIDEO SOTTOTITOLATI
                                                                                         ESERCIZI :   Serie 1 - 2 - 3  - 4 - 5  SERVIZI:   Pronunciatore di inglese - Dizionario - Convertitore IPA/UK - IPA/US - Convertitore di valute in lire ed euro                                              




- Great Painters
- Accounting
- Fundamentals of Law
- Marketing
- Shorthand
- Concept Cars
- Videogames
- The World of Sports

- Blogs
- Free Software
- Google
- My Computer

- PHP Language and Applications
- Wikipedia
- Windows Vista

- Education
- Masterpieces of English Literature
- American English

- English Dictionaries
- The English Language

- Medical Emergencies
- The Theory of Memory
- The Beatles
- Dances
- Microphones
- Musical Notation
- Music Instruments
- Batteries
- Nanotechnology
- Cosmetics
- Diets
- Vegetarianism and Veganism
- Christmas Traditions
- Animals

- Fruits And Vegetables


  1. Atom
  2. Audioblogging
  3. Blog Carnival
  4. Blogcast
  5. Blog feed
  6. Blog fiction
  8. Bloggies
  9. Blogosphere
  10. Blogroll
  11. Blog software
  12. Citizen journalism
  13. Collaborative blog
  14. Community Server
  15. Content Management System
  16. Corporate blog
  17. Dooce
  18. Edublog
  19. Electronic literature
  20. Escribitionist
  21. Facebook
  22. Flaming
  23. Forum moderator
  24. Fotolog
  25. GNU General Public License
  26. Google bomb
  27. Google Reader
  28. Inauthentic Text
  29. International Weblogger's Day
  30. Internet Troll
  31. Linkback
  32. Link rot
  33. List of blogging terms
  34. LiveJournal
  35. Massively distributed collaboration
  36. Micropatronage
  37. Moblog
  38. Moderation system
  39. Movable Type
  40. MySpace
  41. MySQL
  42. News aggregator
  43. Online diary
  44. OPML
  45. PageRank
  46. Permalink
  47. Personal journal
  48. Photoblog
  49. Pingback
  50. Ping-server
  51. Podcasting
  52. Political blog
  53. Project blog
  54. Rating community
  55. Reputation management
  56. Reputation system
  57. RSS
  58. Social media
  59. Spam blog
  60. Spamdexing
  61. Spam in blogs
  62. Sping
  63. Technorati
  64. TrackBack
  65. User generated content
  66. Virtual Community
  67. Vlog
  68. Weblog
  69. Windows Live Spaces
  71. Wordpress
  72. Yahoo 360°
  73. YouTube



L'utente può utilizzare il nostro sito solo se comprende e accetta quanto segue:

  • Le risorse linguistiche gratuite presentate in questo sito si possono utilizzare esclusivamente per uso personale e non commerciale con tassativa esclusione di ogni condivisione comunque effettuata. Tutti i diritti sono riservati. La riproduzione anche parziale è vietata senza autorizzazione scritta.
  • Il nome del sito EnglishGratis è esclusivamente un marchio e un nome di dominio internet che fa riferimento alla disponibilità sul sito di un numero molto elevato di risorse gratuite e non implica dunque alcuna promessa di gratuità relativamente a prodotti e servizi nostri o di terze parti pubblicizzati a mezzo banner e link, o contrassegnati chiaramente come prodotti a pagamento (anche ma non solo con la menzione "Annuncio pubblicitario"), o comunque menzionati nelle pagine del sito ma non disponibili sulle pagine pubbliche, non protette da password, del sito stesso.
  • La pubblicità di terze parti è in questo momento affidata al servizio Google AdSense che sceglie secondo automatismi di carattere algoritmico gli annunci di terze parti che compariranno sul nostro sito e sui quali non abbiamo alcun modo di influire. Non siamo quindi responsabili del contenuto di questi annunci e delle eventuali affermazioni o promesse che in essi vengono fatte!
  • L'utente, inoltre, accetta di tenerci indenni da qualsiasi tipo di responsabilità per l'uso - ed eventuali conseguenze di esso - degli esercizi e delle informazioni linguistiche e grammaticali contenute sul siti. Le risposte grammaticali sono infatti improntate ad un criterio di praticità e pragmaticità più che ad una completezza ed esaustività che finirebbe per frastornare, per l'eccesso di informazione fornita, il nostro utente. La segnalazione di eventuali errori è gradita e darà luogo ad una immediata rettifica.


    ENGLISHGRATIS.COM è un sito personale di
    Roberto Casiraghi e Crystal Jones
    email: robertocasiraghi at iol punto it

    Roberto Casiraghi           
    INFORMATIVA SULLA PRIVACY              Crystal Jones

    Siti amici:  Lonweb Daisy Stories English4Life Scuolitalia
    Sito segnalato da INGLESE.IT


This article is from:

All text is available under the terms of the GNU Free Documentation License: 

Spam in blogs

From Wikipedia, the free encyclopedia


Spam in blogs (also called simply blog spam or comment spam) is a form of spamdexing. It is done by automatically posting random comments, promoting commercial services, to blogs, wikis, guestbooks, or other publicly accessible online discussion boards. Any web application that accepts and displays hyperlinks submitted by visitors may be a target.

Adding links that point to the spammer's web site artificially increases the site's search engine ranking. An increased ranking often results in the spammer's commercial site being listed ahead of other sites for certain searches, increasing the number of potential visitors and paying customers.


This type of spam originally appeared in internet guestbooks, where spammers repeatedly fill a guestbook with links to their own site and no relevant comment to increase search engine rankings. If an actual comment is given it is often just "cool page", "nice website", or keywords of the spammed link.

In 2003, spammers began to take advantage of the open nature of comments in the blogging software like Movable Type by repeatedly placing comments to various blog posts that provided nothing more than a link to the spammer's commercial web site. Jay Allen created a free plugin, called MT-BlackList,[1] for the Movable Type weblog tool (versions prior to 3.2) that attempted to alleviate this problem. Many current blogging packages now have methods of preventing or reducing the effect of blog spam, but spammers become smarter as well. Many of them use special blog spamming tools like Trackback Submitter to bypass comment spam protection on popular blogging systems like Movable Type, Wordpress and others.

Possible solutions

Blocking by keyword

This is simplest form of blocking, which yields very good results, because comment spam is targeted at bots, so it must be readable by simple software. A lot of spam can be blocked by banning names of popular pharmaceuticals and casino games.


In early 2005, Google announced that hyperlinks with rel="nofollow" attribute[2] would not influence the link target's ranking in the search engine's index. The Yahoo and MSN search engines also respect this tag. [3]

nofollow is a misnomer in this case since it actually tells a search engine "Don't score this link" rather than "Don't follow this link." This differs from the meaning of nofollow as used within a robots meta tag, which does tell a search engine: "Do not follow any of the hyperlinks in the body of this document."

Using rel="nofollow" is a much easier solution that makes the improvised techniques above irrelevant. Most weblog software now marks reader-submitted links this way by default (with no option to disable it without code modification). A more sophisticated server software could spare the nofollow for links submitted by trusted users like those registered for a long time, on a whitelist, or with a high karma. Some server software adds rel="nofollow" to pages that have been recently edited but omits it from stable pages, under the theory that stable pages will have had offending links removed by human editors.

Some weblog authors object to the use of rel="nofollow", arguing, for example,[4] that

  • Link spammers will continue to spam everyone to reach the sites that do not use rel="nofollow"
  • Link spammers will continue to place links for clicking (by surfers), even if those links are ignored by search engines.
  • Google is advocating the use of rel="nofollow" in order to reduce the effect of heavy inter-blog linking on page ranking.

Jeremy Zawodny has stated on his blog [5] that

Worse, nofollow has another, more pernicious effect, which is that it reduces the value of legitimate comments.

Other websites like Slashdot, with high user participation, use improvised nofollow implementations like adding rel="nofollow" only for potentially misbehaving users. Potential spammers posing as users can be determined through various heuristics like age of registered account and other factors. Slashdot also uses the poster's karma as a determinant in attaching a nofollow tag to user submitted links.

rel="nofollow" has come to be regarded as a microformat.

Turing tests

Various methods requiring humans to do spamming by hand have been attempted. A variety of CAPTCHA gateways have been implemented, in an effort to prevent bots from submitting entries. Drawbacks to this are the annoyance it poses for regular users, and the ability of some advanced bots to defeat simple Capatchas most of the time.

Disallowing links in posts

There is neglible gain from spam that does not contain links, so currently all spam posts contain (excessive number of) links. It is safe to require passing turing tests only if post contains links and letting all other posts through.


Instead of displaying a direct hyperlink submitted by a visitor, a web application could display a link to a script on its own website that redirects to the correct URL. This will not prevent all spam since spammers do not always check for link redirection, but effectively prevents against increasing their PageRank, just as rel=nofollow. An added benefit is that the redirection script can count how many people visit external URLs, although it will increase the load on the site.

Redirects should be server-side to avoid accessibility issues related to client-side redirects. This can be done via the .htaccess file in Apache.

Another way of preventing PageRank leakage is to make use of public redirection services such as TinyURL or My-Own.Net. For example,

<a href="" rel="nofollow" >Link</a>

where 'alias_of_target' is the alias of target address.

Services such as offer a public redirection without the need to configure an alias. An example of a link to on POW7 would be:

<a href=""></a>

Distributed approaches

This approach is very new to addressing link spam. One of the shortcomings of link spam filters is that most sites only receive one link from each domain which is running a spam campaign. If the spammer varies IP addresses, there is little to no distiguishable pattern left on the vandalized site. The pattern, however, is left across the thousands of sites that were hit quickly with the same links.

A distributed approach, like the free LinkSleeve,[6] uses XML-RPC to communicate between the various server applications (such as blogs, guestbooks, forums, and wikis) and the filter server, in this case LinkSleeve. The posted data is stripped of urls and each url is checked against recently submitted urls across the web. If a threshold is exceeded, a "reject" response is returned, thus deleting the comment, message, or posting. Otherwise, an "accept" message is sent.

A more robust distributed approach is Akismet, which uses a similar approach to LinkSleeve but uses API keys to assign trust to nodes and also has wider distribution as a result of being bundled with the 2.0 release of WordPress.[7] They claim over 140,000 blogs contributing to their system. Akismet libraries have been implemented for Java, Python, Ruby, and PHP, but its adoption may be hindered by the requirement of an API key and its commercial use restrictions. No such restrictions are in place for Linksleeve.

Application-specific anti-spam methods

Particularly popular software products such as Movable Type and MediaWiki have developed their own custom anti-spam measures, as spammers focus more attention on targeting those platforms. Whitelists and blacklists that prevent certain IPs from posting, or that prevent people from posting content that matches certain filters, are common defenses. More advanced access control lists require various forms of validation before users can contribute anything like linkspam.

The goal in every case is to allow good users to continue to add links to their comments, as that is considered by some to be a valuable aspect of any comments section.

RSS feed monitoring

Some wikis allow you to access an RSS feed of recent changes or comments. If you add that to your news reader and set up a smart search for common spam terms (usually viagra and other drug names) you can quickly identify and remove the offending spam.

Response tokens

Another filter available to webmasters is to add a hidden session token or hash function to their comment form. When the comments are submitted, data stored within the posting such as IP address and time of posting can be compared to the data stored with the session token or hash generated when the user loaded the comment form. Postings that use different IP addresses for loading the comment form and posting the comment form, or postings that took unusually short or long periods of time to compose can be filtered out. This method is particularly effective against spammers who spoof their IP Address in an attempt to conceal their identities.


Some blog software such as Typo allow the blog administrator to only allow comments submitted via Ajax XMLHttpRequests, and discard regular form POST requests. This causes accessibility problems typical to Ajax-only applications.

Although this technique prevents spam so far, it is a form of security by obscurity and will probably be defeated if it becomes popular enough.

Switching off comments

Some bloggers have chosen to turn off comments because of the volume of spam.

See also

  • Social networking spam


  1. ^
  2. ^
  3. ^
  4. ^
  5. ^
  6. ^
  7. ^

External links

  • Latest protection technique against comment spam on Wordpress.
  • Anti-spam Features of MediaWiki
  • Article about latest spamming techniques on Search Engine Journal
  • Six Apart Comment Spam Guide, fairly broad overview from Movable Type's authors.
  • The (Evil) Genius of Comment Spammers, an article on link spam from Wired magazine.
  • Gilad Mishne, David Carmel and Ronny Lempel: Blocking Blog Spam with Language Model Disagreement, PDF. From the First International Workshop on Adversarial Information Retrieval (AIRWeb'05) Chiba, Japan, 2005.
  • A Comprehensive Guide to Protecting Your Blog from Spam - a series of measures you can follow to making your WordPress Blog spamfree
  • Spam Huntress The Norwegian Spam Huntress - Ann Elisabeth
  • Anti Spam Articles. -Anti Spam Articles and lots of information.
  • SecuriTeam Blogs Spam section Intensive technical posting by the Gadi Evron on blog spam techniques and counter-measures.
  • SignedPing An open specification for blog security to combat spam.
Retrieved from ""