Copyright (c) 2011 Rexer Analytics All Rights Reserved
2007 Data Miner Survey:
2007 Data Miner Survey:
Thank you for your interest in the First Annual Rexer Analytics Data Miner Survey.
This research examined of the analytic behaviors, needs and preferences of data mining professionals. It was conducted as a service to the data mining community. It was not conducted for, or sponsored by, any third party. Rexer Analytics is committed to freely disseminating our research findings through report summaries, conference presentations, and personal contact. If you would like a copy of our 7 page summary report, please contact us (DataMinerSurvey@RexerAnalytics.com). Summaries of this research were presented at the October 2007 Oracle BIWA Summit and the October 2007 SPSS Directions Conference.
This survey has been conducted annually since 2007. Highlights for each year are available online. Contact us to receive the full summary reports (FREE).
27-item survey of data miners, conducted on-line in early 2007
314 responses from individuals in 35 countries
Regression, decision trees and cluster analysis were the most commonly used algorithms (mean number of algorithms used: 6.8)
Top challenges data miners report are dirty data, data access, and explaining data mining to others
SPSS, SPSS Clementine, and SAS are the three most frequently utilized tools (mean number of tools used: 4.5)
There is increasing interest in the Oracle Data Mining tool, and decreasing interest in C4.5/C5.0/See5
The primary factors data miners consider when selecting an analytic tool are: 1) the dependability and stability of software, 2) the ability to handle large data sets, and 3) data manipulation capabilities
The findings vary somewhat depending on the domain in which the data miner works, the tools used, geography, and several other dimensions