Atricle Dump
#1 in Business Subscribe Email Print

You are here: Home > Computers and Technology > Software > The Basics Of Bayesian Spam Filtering

Tags

  • lowering
  • market
  • probability
  • total scorethe
  • encounter certain
  • other simple

  • Links

  • Discover What Black Cohosh Can Do For Female Health
  • South Florida Commercial Real Estate: Scouting the Best Deal
  • Christmas Planning - 4 Last Minute Items You Don't Want to Miss!
  • Atricle Dump - The Basics Of Bayesian Spam Filtering

    Towering Web Site Traffic Is Always The Reward For Consistency
    The tricks of the trade that will get you towering web site traffic are very similar to those that will work for you at the stock market. Just think about it for a minute, what sort of people make money off Wall Street? Mostly folks who are pretty consistent in what they do.Now, you probably don't know this but consistency is one of the most common p
    ord pairs and phrases
  • HTML code, such as colors
  • Where a particular phrase appears (meta information)

    The Problems With Scoring Content Based Filters

    Though the scoring based spam filters work well, they also encounter certain problems; the normal ones more so than the Bayesian spam filters. These are some of the problems faced:

    • The scoring content based spam filters build a list of characteristics from the spam emails and the good email
      What is an Investor Ready Business Plan
      A Business Plan, as all good entrepreneurs starting out in life should know is the foundation, or rather a springboard, towards the establishment and growth of a new business. A business plan is an essential tool for companies raising capital – and your business plan needs to be Investor Ready.What is an Investor Ready business plan?An investo
      Bayesian spam filtering has become a popular way to distinguish between legitimate emails and illegitimate spam emails, through a process that uses Bayesian statistical methods. It filters emails by classifying documents into categories. Based on the contents of the message in your email, the Bayesian spam filters calculate the probability of the message being a spam. They are much more robust than the normal content based filters, and their anti spam approach hardly has false positives.

      Normally when you receive an email, one look tells you whether the email is a spam or not. To your eyes, there is ‘zero’ probability of a spam looking like a good email. How would it be if spam filters, too, worked in the same way!

      Bayesian Spam Filters

      Bayesian spam filters are what are known as scoring content-based spam filters. They try to work the way your eye does in identifying spam emails, by looking for words and other characteristics that typify spam. Every characteristic typical of spam is assigned a score, and the total spam score for the whole message is computed. Depending on the type of Bayesian spam filter you are using, it may also look for legitimate email characteristics, thereby lowering the total score.

      The basic difference between the Bayesian spam filters and other simple scoring content based spam filters is that the Bayesian spam filters build the list themselves, as against other filters that depend on a manually built list of characteristics.

      You start with a sizable bunch of emails you have identified as spam, and another bunch of good emails. The filters look at both, the legitimate and the spam emails and calculate in what probability various characters appear in them. Bayesian spam filters may look at:

      • The words in the message body
      • The headers (message paths and senders)
      • The word pairs and phrases
      • HTML code, such as colors
      • Where a particular phrase appears (meta information)

      The Problems With Scoring Content Based Filters

      Though the scoring based spam filters work well, they also encounter certain problems; the normal ones more so than the Bayesian spam filters. These are some of the problems faced:

      • The scoring content based spam filters build a list of characteristics from the spam emails and the good emails
        What Does Your Logo Color Say About Your Business?
        And not only your logo, but also your website, your brochure, your business cards and any of your marketing materials for that matter.Yes, colors do matterThey communicate feelings and emotions. They represent ideas and thoughts. So before you create a logo or any other piece of marketing make sure you select the right colors to communicate a
        ves.

        Normally when you receive an email, one look tells you whether the email is a spam or not. To your eyes, there is ‘zero’ probability of a spam looking like a good email. How would it be if spam filters, too, worked in the same way!

        Bayesian Spam Filters

        Bayesian spam filters are what are known as scoring content-based spam filters. They try to work the way your eye does in identifying spam emails, by looking for words and other characteristics that typify spam. Every characteristic typical of spam is assigned a score, and the total spam score for the whole message is computed. Depending on the type of Bayesian spam filter you are using, it may also look for legitimate email characteristics, thereby lowering the total score.

        The basic difference between the Bayesian spam filters and other simple scoring content based spam filters is that the Bayesian spam filters build the list themselves, as against other filters that depend on a manually built list of characteristics.

        You start with a sizable bunch of emails you have identified as spam, and another bunch of good emails. The filters look at both, the legitimate and the spam emails and calculate in what probability various characters appear in them. Bayesian spam filters may look at:

        • The words in the message body
        • The headers (message paths and senders)
        • The word pairs and phrases
        • HTML code, such as colors
        • Where a particular phrase appears (meta information)

        The Problems With Scoring Content Based Filters

        Though the scoring based spam filters work well, they also encounter certain problems; the normal ones more so than the Bayesian spam filters. These are some of the problems faced:

        • The scoring content based spam filters build a list of characteristics from the spam emails and the good email
          Don't Fight the Market
          I was watching a film the other day and one of the comments that really hit home was 'Don't fight the market'.Some of you may know the film 'Rogue Trader', which was based on the story of a futures trader in Singapore. The main character, Nick Leeson, was trying to manipulate the market, to his own advantage. Initially, his plan worked, but as time w
          y spam. Every characteristic typical of spam is assigned a score, and the total spam score for the whole message is computed. Depending on the type of Bayesian spam filter you are using, it may also look for legitimate email characteristics, thereby lowering the total score.

          The basic difference between the Bayesian spam filters and other simple scoring content based spam filters is that the Bayesian spam filters build the list themselves, as against other filters that depend on a manually built list of characteristics.

          You start with a sizable bunch of emails you have identified as spam, and another bunch of good emails. The filters look at both, the legitimate and the spam emails and calculate in what probability various characters appear in them. Bayesian spam filters may look at:

          • The words in the message body
          • The headers (message paths and senders)
          • The word pairs and phrases
          • HTML code, such as colors
          • Where a particular phrase appears (meta information)

          The Problems With Scoring Content Based Filters

          Though the scoring based spam filters work well, they also encounter certain problems; the normal ones more so than the Bayesian spam filters. These are some of the problems faced:

          • The scoring content based spam filters build a list of characteristics from the spam emails and the good email
            The Power of PAUSE in Public Speaking
            When writing a speech, the general estimate for "words per minute" is 120 to 150. But when I'm hired as a speech writer, I provide 150. That's because a lot of people believe that "more words means more value," and when they practice their speeches, 150 almost feels like not enough.That's because they're racing, and it's among the worst things they c
            ters build the list themselves, as against other filters that depend on a manually built list of characteristics.

            You start with a sizable bunch of emails you have identified as spam, and another bunch of good emails. The filters look at both, the legitimate and the spam emails and calculate in what probability various characters appear in them. Bayesian spam filters may look at:

            • The words in the message body
            • The headers (message paths and senders)
            • The word pairs and phrases
            • HTML code, such as colors
            • Where a particular phrase appears (meta information)

            The Problems With Scoring Content Based Filters

            Though the scoring based spam filters work well, they also encounter certain problems; the normal ones more so than the Bayesian spam filters. These are some of the problems faced:

            • The scoring content based spam filters build a list of characteristics from the spam emails and the good email
              Choose The Right Keywords For Your Site To Make More Money Now
              Many recognize that search engines can bring volumes of highly targeted prospects to their website, typically at a fraction of the cost of traditional marketing.Unfortunately, these same people often overlook the most important part of their search engine marketing campaigns, which is keyphrase selection and evaluation.Keyphrases (those phrase
              ord pairs and phrases
            • HTML code, such as colors
            • Where a particular phrase appears (meta information)

            The Problems With Scoring Content Based Filters

            Though the scoring based spam filters work well, they also encounter certain problems; the normal ones more so than the Bayesian spam filters. These are some of the problems faced:

            • The scoring content based spam filters build a list of characteristics from the spam emails and the good emails they get. For building a good list of spam characteristics, mail needs to be collected from hundreds of sources (email addresses). This may weaken the efficiency of the spam filters, as the characteristics of the good email would be different for each person.

            • If the spammers make an effort to make their mails look like genuine mails, the filtering characteristics may have to be corrected manually - a very big effort.

  • HTTP = HTML link (for blogs, profiles,phorums):
    <a href="http://www.articledump.net/article/178645/articledump-The-Basics-Of-Bayesian-Spam-Filtering.html">The Basics Of Bayesian Spam Filtering</a>

    BB link (for phorums):
    [url=http://www.articledump.net/article/178645/articledump-The-Basics-Of-Bayesian-Spam-Filtering.html]The Basics Of Bayesian Spam Filtering[/url]

    Related Articles:

    Joint Venture Marketing on the Internet - Increase Traffic & Make Money the Easy Way

    The Importance of Life Insurance

    Whole Life Insurance Online Quote – You Do Have the Time!

    Bookmark it: del.icio.us digg.com reddit.com netvouz.com google.com yahoo.com technorati.com furl.net bloglines.com socialdust.com ma.gnolia.com newsvine.com slashdot.org simpy.com shadows.com blinklist.com