Data Mining and Data Visualization

Pirmais vāks
Elsevier, 2005. gada 2. maijs - 800 lappuses
Data Mining and Data Visualization focuses on dealing with large-scale data, a field commonly referred to as data mining. The book is divided into three sections. The first deals with an introduction to statistical aspects of data mining and machine learning and includes applications to text analysis, computer intrusion detection, and hiding of information in digital files. The second section focuses on a variety of statistical methodologies that have proven to be effective in data mining applications. These include clustering, classification, multivariate density estimation, tree-based methods, pattern recognition, outlier detection, genetic algorithms, and dimensionality reduction. The third section focuses on data visualization and covers issues of visualization of high-dimensional data, novel graphical techniques with a focus on human factors, interactive graphics, and data visualization using virtual reality. This book represents a thorough cross section of internationally renowned thinkers who are inventing methods for dealing with a new data paradigm.
  • Distinguished contributors who are international experts in aspects of data mining
  • Includes data mining approaches to non-numerical data mining including text data, Internet traffic data, and geographic data
  • Highly topical discussions reflecting current thinking on contemporary technical issues, e.g. streaming data
  • Discusses taxonomy of dataset sizes, computational complexity, and scalability usually ignored in most discussions
  • Thorough discussion of data visualization issues blending statistical, human factors, and computational insights

No grāmatas satura

Saturs

body
1
2 From Data Mining to Knowledge Mining
47
3 Mining Computer Securitycomputer security Data
77
4 Data Mining of Text Files
109
5 Text Data Mining with Minimal Spanning Trees
133
Steganography and Steganalysis
171
7 Canonical Variate Analysis and Related Methods for Reduction of Dimensionality and Graphical Representation
189
8 Pattern Recognition
213
12 Fast Algorithms for Classification Using Class Cover Catch Digraphs
331
13 On Genetic Algorithms and their Applications
359
14 Computational Methods for HighDimensional Rotations in Data Visualization
391
15 Some Recent Graphics Templates and Software for Showing Statistical Summaries
415
the Paradigm of Linked Views
437
17 Data Visualization and Virtual Reality
539
back matter
565
index
609

9 Multidimensional Density Estimation
229
10 Multivariate Outlier Detection and Robustness
263
11 Classification and Regression Trees Bagging and Boosting
303
Contents of Previous Volumes
619
Autortiesības

Citi izdevumi - Skatīt visu

Bieži izmantoti vārdi un frāzes

Populāri fragmenti

169. lappuse - The views and conclusions contained in this document are those of the authors and should not be interpreted as representing the official policies, either expressed or implied, of the Defense Advanced Research Projects Agency or the US Government.
19. lappuse - Equal to != Not equal to > Greater than < Less than >= Greater than or equal to...
80. lappuse - Type of service Total length Identification Flags Fragment offset Time to live Protocol Header checksum Source IP address Destination IP address IP options The version of the IP protocol used to create the datagram.
127. lappuse - A detailed description of fast techniques is beyond the scope of this chapter; the interested reader is referred to [29, 31,32,37,38] for further information.
554. lappuse - A genetic algorithm is an iterative procedure that consists of a constant-size population of individuals, each one represented by a finite string of symbols, known as the genome, encoding a possible solution in a given problem space. This space, referred to as the search space, comprises all possible solutions to the problem at hand.
361. lappuse - Indeed, it is obvious that invention or discovery, be it in mathematics or anywhere else, takes place by combining ideas. Now, there is an extremely great number of such combinations, most of which are devoid of interest, while on the contrary, very few of them can be fruitful.
300. lappuse - Principal component analysis based on robust estimators of the covariance or correlation matrix: Influence functions and efficiencies.
332. lappuse - D — (V, A) where V is a set of vertices, and A is a set of ordered pairs of vertices, corresponding to the directed edges (sometimes called arcs) of the graph.
180. lappuse - Eq. (4) are shown in Figure 4. It can be seen that in the...

Par autoru (2005)

Professor C. R. Rao, born in India, is one of this century's foremost statisticians, and received his education in statistics at the Indian Statistical Institute (ISI), Calcutta. He is Emeritus Holder of the Eberly Family Chair in Statistics at Penn State and Director of the Center for Multivariate Analysis. He has long been recognized as one of the world's top statisticians, and has been awarded 34 honorary doctorates from universities in 19 countries spanning 6 continents. His research has influenced not only statistics, but also the physical, social and natural sciences and engineering.

In 2011 he was recipient of the Royal Statistical Society's Guy Medal in Gold which is awarded triennially to those "who are judged to have merited a signal mark of distinction by reason of their innovative contributions to the theory or application of statistics". It can be awarded both to fellows (members) of the Society and to non-fellows. Since its inception 120 years ago the Gold Medal has been awarded to 34 distinguished statisticians. The first medal was awarded to Charles Booth in 1892. Only two statisticians, H. Cramer (Norwegian) and J. Neyman (Polish), outside Great Britain were awarded the Gold medal and C. R. Rao is the first non-European and non-American to receive the award.

Other awards he has received are the Gold Medal of Calcutta University, Wilks Medal of the American Statistical Association, Wilks Army Medal, Guy Medal in Silver of the Royal Statistical Society (UK), Megnadh Saha Medal and Srinivasa Ramanujan Medal of the Indian National Science Academy, J.C.Bose Gold Medal of Bose Institute and Mahalanobis Centenary Gold Medal of the Indian Science Congress, the Bhatnagar award of the Council of Scientific and Industrial Research, India and the Government of India honored him with the second highest civilian award, Padma Vibhushan, for “outstanding contributions to Science and Engineering / Statistics , and also instituted a cash award in honor of C R Rao, “to be given once in two years to a young statistician for work done during the preceding 3 years in any field of statistics .

For his outstanding achievements Rao has been honored with the establishment of an institute named after him, C.R.Rao Advanced Institute for Mathematics, Statistics and Computer Science, in the campus of the University of Hyderabad, India.

Bibliogrāfiskā informācija