3rd International Conference of the ERCIM WG on
10-12 December 2010, Senate House, University of London, UK

Robust Analysis of Complex Data Sets


Scientific experiments often generate a large number of measurements. Also in an industrial or business environment, the number of available variables for different products or customers may become huge, due to ever more powerful monitoring systems. Multivariate statistical modeling is typically used to understand better the relationships between different variables, but their use becomes cumbersome if a high number of variables is measured. In this case, the use dimension reduction techniques, becomes appropriate. Another issue is that a traditional multivariate approach is based on over-simplified models, like multivariate normality. The use of robust methods not depending on unrealistic model assumptions is indispensable, and allows extracting features and structures in the data in a reliable way. While robust methods are well established for dealing with simple models, as the regression and location-scale model, there is still work to do for more complicated, multivariate and non-linear models. Since atypical observations are frequently present when analyzing complex data sets, new robust methods need to be introduced. Practical implementation and computational feasibility are of major importance in robust data mining.


This track focuses on methods that are considered as data-mining techniques, including supervised and unsupervised learning. Particular topics for contributions are:

Full papers containing a strong computational or data analytic component will be considered for publication in the second Special Issue of Machine Learning and Robust Data Mining of the journal Computational Statistics and Data Analysis. All submissions must contain original unpublished work not being considered for publication elsewhere. Submissions will be refereed according to standard procedures for CSDA.