Copyright (c) 2011 Rexer Analytics All Rights Reserved
2007 Data Miner Survey:
2007 Data Miner Survey:  

Thank you for your interest in the First Annual Rexer Analytics Data Miner Survey.  

This research examined of the analytic behaviors, needs and preferences of data
mining professionals.  It was conducted as a service to the data mining community.  
It was not conducted for, or sponsored by, any third party.  Rexer Analytics is
committed to freely disseminating our research findings through report summaries,
conference presentations, and personal contact.  If you would like a copy of our 7
page summary report, please contact us (
DataMinerSurvey@RexerAnalytics.com).  
Summaries of this research were presented at the October 2007 Oracle BIWA
Summit and the October 2007 SPSS Directions Conference.  



























This survey has been conducted since 2007.  Highlights for each year are available
online.  
Contact us to receive the full summary reports (FREE).
2007 HIGHLIGHTS:

  • 27-item survey of data miners, conducted on-line in early 2007

  • 314 responses from individuals in 35 countries

  • Regression, decision trees and cluster analysis were the most commonly
    used algorithms (mean number of algorithms used: 6.8)

  • Top challenges data miners report are dirty data, data access, and explaining
    data mining to others

  • SPSS, SPSS Clementine, and SAS are the three most frequently utilized tools
    (mean number of tools used: 4.5)

  • There is increasing interest in the Oracle Data Mining tool, and decreasing
    interest in C4.5/C5.0/See5

  • The primary factors data miners consider when selecting an analytic tool are:
    1) the dependability and stability of software, 2) the ability to handle large
    data sets, and 3) data manipulation capabilities

  • The findings vary somewhat depending on the domain in which the data
    miner works, the tools used, geography, and several other dimensions