Data Mining and Data VisualizationElsevier, 2005. gada 2. maijs - 800 lappuses Data Mining and Data Visualization focuses on dealing with large-scale data, a field commonly referred to as data mining. The book is divided into three sections. The first deals with an introduction to statistical aspects of data mining and machine learning and includes applications to text analysis, computer intrusion detection, and hiding of information in digital files. The second section focuses on a variety of statistical methodologies that have proven to be effective in data mining applications. These include clustering, classification, multivariate density estimation, tree-based methods, pattern recognition, outlier detection, genetic algorithms, and dimensionality reduction. The third section focuses on data visualization and covers issues of visualization of high-dimensional data, novel graphical techniques with a focus on human factors, interactive graphics, and data visualization using virtual reality. This book represents a thorough cross section of internationally renowned thinkers who are inventing methods for dealing with a new data paradigm.
|
Saturs
1 | |
47 | |
77 | |
109 | |
5 Text Data Mining with Minimal Spanning Trees | 133 |
Steganography and Steganalysis | 171 |
7 Canonical Variate Analysis and Related Methods for Reduction of Dimensionality and Graphical Representation | 189 |
8 Pattern Recognition | 213 |
12 Fast Algorithms for Classification Using Class Cover Catch Digraphs | 331 |
13 On Genetic Algorithms and their Applications | 359 |
14 Computational Methods for HighDimensional Rotations in Data Visualization | 391 |
15 Some Recent Graphics Templates and Software for Showing Statistical Summaries | 415 |
the Paradigm of Linked Views | 437 |
17 Data Visualization and Virtual Reality | 539 |
back matter | 565 |
index | 609 |
9 Multidimensional Density Estimation | 229 |
10 Multivariate Outlier Detection and Robustness | 263 |
11 Classification and Regression Trees Bagging and Boosting | 303 |
Contents of Previous Volumes | 619 |
Citi izdevumi - Skatīt visu
Data Mining and Data Visualization Calyampudi Radhakrishna Rao,Edward J. Wegman,Jeffrey L. Solka Priekšskatījums nav pieejams - 2005 |
Bieži izmantoti vārdi un frāzes
applications approach attributes bar chart bits boxplots C.R. Rao canonical coordinates classifier clustering color figures section color reproduction corresponding covariance crossover data analysis data mining data set data visualization database defined density estimation detection dimensions distance distribution dominating set example exploratory data analysis frame function genetic algorithms graph graphical elements groups Hellinger distance highlighting histogram IEEE implemented interactive statistical kernel knowledge mining learning sample linear linking LM plots Machine Learning matrix measure methods Michalski misclassification rate mosaic plot multivariate nodes nonparametric objects observations operator optimal Order Statistics outliers packets parallel coordinates parameters pattern pixels prediction principal components problem profiles projection random regression represent robust rotations rule sample population scatterplot selection sequence shows space split statistical graphics steganalysis steganography strings structure subset techniques tion tree values variables vector Wegman
Populāri fragmenti
169. lappuse - The views and conclusions contained in this document are those of the authors and should not be interpreted as representing the official policies, either expressed or implied, of the Defense Advanced Research Projects Agency or the US Government.
19. lappuse - Equal to != Not equal to > Greater than < Less than >= Greater than or equal to...
80. lappuse - Type of service Total length Identification Flags Fragment offset Time to live Protocol Header checksum Source IP address Destination IP address IP options The version of the IP protocol used to create the datagram.
127. lappuse - A detailed description of fast techniques is beyond the scope of this chapter; the interested reader is referred to [29, 31,32,37,38] for further information.
554. lappuse - A genetic algorithm is an iterative procedure that consists of a constant-size population of individuals, each one represented by a finite string of symbols, known as the genome, encoding a possible solution in a given problem space. This space, referred to as the search space, comprises all possible solutions to the problem at hand.
361. lappuse - Indeed, it is obvious that invention or discovery, be it in mathematics or anywhere else, takes place by combining ideas. Now, there is an extremely great number of such combinations, most of which are devoid of interest, while on the contrary, very few of them can be fruitful.
300. lappuse - Principal component analysis based on robust estimators of the covariance or correlation matrix: Influence functions and efficiencies.
332. lappuse - D — (V, A) where V is a set of vertices, and A is a set of ordered pairs of vertices, corresponding to the directed edges (sometimes called arcs) of the graph.
180. lappuse - Eq. (4) are shown in Figure 4. It can be seen that in the...