Ending Spam: Bayesian Content Filtering and the Art of Statistical Language Classification

No Starch Press, 2005 - 312 lappuses

Through considerable research, creative minds have invented clever new ways to fight spam in all its nefarious forms. This landmark title describes, in depth, how statistical filtering is being used by next generation spam filters to identify and filter spam. Zdziarski explains how spam filtering works and how language classification and machine learning combine to produce remarkably accurate spam filters. Readers gain a complete understanding of the mathematical approaches used in today's spam filters, decoding, tokenization, the use of various algorithms (including Bayesian analysis and Markovian discrimination), and the benefits of using open-source solutions to end spam. Interviews with the creators of many of the best spam filters provide further insight into the anti-spam crusade.

Priekšskatīt šo grāmatu »

No grāmatas satura

Atlasītās lappuses

Satura rādītājs

Indekss

Saturs

PART II FUNDAMENTALS OF STATISTICAL FILTERING	85

PART III ADVANCED CONCEPTS OF STATISTICAL FILTERING	175

APPENDIX SHINING EXAMPLES OF FILTERING	257

Index	275

Autortiesības

Citi izdevumi - Skatīt visu

Ending Spam: Bayesian Content Filtering and the Art of Statistical Language ...
Jonathan A. Zdziarski
Fragmentu skats - 2005

Bieži izmantoti vārdi un frāzes

able accuracy accurate actually additional algorithms allows amount analysis appear approach attacks attempts Bayesian become better blacklist chained tokens Chapter characters classification combination components concepts considered corpus correct create database dataset decision matrix depending designed determine developer discussed disk effective encoding entire error example functions going guilty headers identify implementation important improve individual innocent inoculation interesting Internet language learning legitimate look machine Markovian measure necessary once original particular percent perform period phrases present probability problem purge receive records require rules sender server significant simply simulation single solution space spam filters spammers specific statistical filters storage Subject tokens types user’s users usually viagra weight

Par autoru (2005)

Jonathan A. Zdziarski has been fighting spam for eight years, and has spent a significant portion of the past two years working on the next generation spam filter DSPAM, with up to 99.985% accuracy. Zdziarski lectures widely on the topic of spam.

Bibliogrāfiskā informācija

Nosaukums	Ending Spam: Bayesian Content Filtering and the Art of Statistical Language Classification No Starch Press Series
Autors	Jonathan A. Zdziarski
Izdevums:	ilustrēts
Izdevējs	No Starch Press, 2005
ISBN	1593270526, 9781593270520
Apjoms	312 lappuses

Eksportēt avotu	BiBTeX EndNote RefMan

Par Google grāmatām - Konfidencialitātes politika - Pakalpojuma _ noteikumi - Informācija izdevējiem - Ziņojiet par problēmu - Palīdzība - Google sākumlapa

Grāmatas