Advances in Data Analysis (eBook)
XVI, 687 Seiten
Springer Berlin (Verlag)
978-3-540-70981-7 (ISBN)
This book focuses on exploratory data analysis, learning of latent structures in datasets, and unscrambling of knowledge. Coverage details a broad range of methods from multivariate statistics, clustering and classification, visualization and scaling as well as from data and time series analysis. It provides new approaches for information retrieval and data mining and reports a host of challenging applications in various fields.
Preface 6
Contents 9
Clustering 17
Mixture Models for Classification 18
How to Choose the Number of Clusters: The Cramer Multiplicity Solution 30
Model Selection Criteria for Model-Based Clustering of Categorical Time Series Data: A Monte Carlo Study 38
Cluster Quality Indexes for Symbolic Classification – An Examination 46
Semi-Supervised Clustering: Application to Image Segmentation 54
A Method for Analyzing the Asymptotic Behavior of the Walk Process in Restricted Random Walk Cluster Algorithm 66
Cluster and Select Approach to Classifier Fusion 74
Random Intersection Graphs and Classification 82
Optimized Alignment and Visualization of Clustering Results 90
Finding Cliques in Directed Weighted Graphs Using Complex Hermitian Adjacency Matrices 98
Text Clustering with String Kernels in R 106
Automatic Classification of Functional Data with Extremal Information 114
Typicality Degrees and Fuzzy Prototypes for Clustering 122
On Validation of Hierarchical Clustering 130
Classification 138
Rearranging Classified Items in Hierarchies Using Categorization Uncertainty 139
Localized Linear Discriminant Analysis 147
Calibrating Classifier Scores into Probabilities 155
Nonlinear Support Vector Machines Through Iterative Majorization and I- Splines 163
Deriving Consensus Rankings from Benchmarking Experiments 176
Classification of Contradiction Patterns 184
Selecting SVM Kernels and Input Variable Subsets in Credit Scoring Models 192
Data and Time Series Analysis 200
Simultaneous Selection of Variables and Smoothing Parameters in Geoadditive Regression Models 201
Modelling and Analysing Interval Data 209
Testing for Genuine Multimodality in Finite Mixture Models: Application to Linear Regression Models 221
Happy Birthday to You, Mr. Wilcoxon! 229
Equivalent Number of Degrees of Freedom for Neural Networks 241
Model Choice for Panel Spatial Models: Crime Modeling in Japan 249
A Boosting Approach to Generalized Monotonic Regression 257
From Eigenspots to Fisherspots – Latent Spaces in the Nonlinear Detection of Spot Patterns in a Highly Varying Background 267
Identifying and Exploiting Ultrametricity 275
Factor Analysis for Extraction of Structural Components and Prediction in Time Series 285
Classification of the U.S. Business Cycle by Dynamic Linear Discriminant Analysis 293
Examination of Several Results of Different Cluster Analyses with a Separate View to Balancing the Economic and Ecological Performance Potential of Towns and Cities 301
Visualization and Scaling Methods 309
VOS: A New Method for Visualizing Similarities Between Objects 310
Multidimensional Scaling of Asymmetric Proximities with a Dominance Point 318
Single Cluster Visualization to Optimize Air Traffic Management 330
Rescaling Proximity Matrix Using Entropy Analyzed by INDSCAL 337
Information Retrieval, Data and Web Mining 345
Canonical Forms for Frequent Graph Mining 346
Applying Clickstream Data Mining to Real- Time Web Crawler Detection and Containment Using ClickTips Platform 359
Plagiarism Detection Without Reference Collections 367
Putting Successor Variety Stemming to Work 375
Collaborative Filtering Based on User Trends 383
Investigating Unstructured Texts with Latent Semantic Analysis 391
Marketing, Management Science and Economics 399
Heterogeneity in Preferences for Odd Prices 400
Classification of Reference Models 408
Adaptive Conjoint Analysis for Pricing Music Downloads 416
Improving the Probabilistic Modeling of Market Basket Data 424
Classification in Marketing Research by Means of LEM2- generated Rules 432
Pricing Energy in a Multi-Utility Market 440
Disproportionate Samples in Hierarchical Bayes CBC Analysis 448
Building on the Arules Infrastructure for Analyzing Transaction Data with R 456
Balanced Scorecard Simulator – A Tool for Stochastic Business Figures 464
Integration of Customer Value into Revenue Management 472
Women’s Occupational Mobility and Segregation in the Labour Market: Asymmetric Multidimensional Scaling 480
Multilevel Dimensions of Consumer Relationships in the Healthcare Service Market M- L IRT vs. M- L SEM Approach 488
Data Mining in Higher Education 496
Attribute Aware Anonymous Recommender Systems 504
Banking and Finance 512
On the Notions and Properties of Risk and Risk Aversion in the Time Optimal Approach to Decision Making 513
A Model of Rational Choice Among Distributions of Goal Reaching Times 521
On Goal Reaching Time Distributions Estimated from DAX Stock Index Investments 529
Credit Risk of Collaterals: Examining the Systematic Linkage between Insolvencies and Physical Assets in Germany 537
Foreign Exchange Trading with Support Vector Machines 545
The Influence of Specific Information on the Credit Risk Level 553
Bio- and Health Sciences 561
Enhancing Bluejay with Scalability, Genome Comparison and Microarray Visualization 562
Discovering Biomarkers for Myocardial Infarction from SELDI- TOF Spectra 574
Joint Analysis of In-situ Hybridization and Gene Expression Data 582
Unsupervised Decision Trees Structured by Gene Ontology ( GO- UDTs) for the Interpretation of Microarray Data 590
Linguistics and Text Analysis 598
Clustering of Polysemic Words 599
Classifying German Questions According to Ontology- Based Answer Types 607
The Relationship of Word Length and Sentence Length: The Inter- Textual Perspective 615
Comparing the Stability of Different Clustering Results of Dialect Data 623
Part-of-Speech Discovery by Clustering Contextual Features 631
Statistical Musicology and Sound Classification 639
A Probabilistic Framework for Audio-Based Tonal Key and Chord Recognition 640
Using MCMC as a Stochastic Optimization Procedure for Monophonic and Polyphonic Sound 648
Vowel Classification by a Neurophysiologically Parameterized Auditory Model 656
Archaeology 664
Uncovering the Internal Structure of the Roman Brick and Tile Making in Frankfurt- Nied by Cluster Validation 665
Where Did I See You Before... A Holistic Method to Compare and Find Archaeological Artifacts 673
Keywords 683
Author Index 687
Erscheint lt. Verlag | 24.3.2007 |
---|---|
Reihe/Serie | Studies in Classification, Data Analysis, and Knowledge Organization | Studies in Classification, Data Analysis, and Knowledge Organization |
Zusatzinfo | XVI, 687 p. |
Verlagsort | Berlin |
Sprache | englisch |
Themenwelt | Mathematik / Informatik ► Informatik ► Datenbanken |
Mathematik / Informatik ► Mathematik ► Statistik | |
Mathematik / Informatik ► Mathematik ► Wahrscheinlichkeit / Kombinatorik | |
Medizin / Pharmazie ► Allgemeines / Lexika | |
Technik | |
Wirtschaft | |
Schlagworte | Analysis • classification • Clustering • Data Analysis • Data Mining • Information Retrieval • machine learning • Multivariate Statistics • Statistics • Time Series |
ISBN-10 | 3-540-70981-9 / 3540709819 |
ISBN-13 | 978-3-540-70981-7 / 9783540709817 |
Haben Sie eine Frage zum Produkt? |
Größe: 9,6 MB
DRM: Digitales Wasserzeichen
Dieses eBook enthält ein digitales Wasserzeichen und ist damit für Sie personalisiert. Bei einer missbräuchlichen Weitergabe des eBooks an Dritte ist eine Rückverfolgung an die Quelle möglich.
Dateiformat: PDF (Portable Document Format)
Mit einem festen Seitenlayout eignet sich die PDF besonders für Fachbücher mit Spalten, Tabellen und Abbildungen. Eine PDF kann auf fast allen Geräten angezeigt werden, ist aber für kleine Displays (Smartphone, eReader) nur eingeschränkt geeignet.
Systemvoraussetzungen:
PC/Mac: Mit einem PC oder Mac können Sie dieses eBook lesen. Sie benötigen dafür einen PDF-Viewer - z.B. den Adobe Reader oder Adobe Digital Editions.
eReader: Dieses eBook kann mit (fast) allen eBook-Readern gelesen werden. Mit dem amazon-Kindle ist es aber nicht kompatibel.
Smartphone/Tablet: Egal ob Apple oder Android, dieses eBook können Sie lesen. Sie benötigen dafür einen PDF-Viewer - z.B. die kostenlose Adobe Digital Editions-App.
Zusätzliches Feature: Online Lesen
Dieses eBook können Sie zusätzlich zum Download auch online im Webbrowser lesen.
Buying eBooks from abroad
For tax law reasons we can sell eBooks just within Germany and Switzerland. Regrettably we cannot fulfill eBook-orders from other countries.
aus dem Bereich