The Data Mining Application Based on WEKA: Geographical Original of Music
Abstract
In this article, we use Weka as our tool for data mining. In the first step, we retrieve the dataset from the UCI database. At the same time, we get the purpose of analysis. Then we classify the datasets and found class attributes. We classify the datasets into two attributes: latitude and longitude. The second step, we finish the data cleaning. The tools used for data cleansing are Microsoft Excel, Google Maps, and Weka. The next is aggregation and Skewed Data. Then the appropriate attributes of dataset are selected. The third step, it’s the experiment design. We choose three classifiers: Naive Bayes, J48 and IBk. The fourth step, we finally get the experimental results through the appropriate classifiers, and the results are summarized. The fifth step, to make ROC Curve. The sixth step, the analysis of final results. Of three aspects: classifier analysis, attribute analysis, noise analysis. In the seventh step, we get the final conclusion that IB1 is the most successful model for our dataset.
Keywords
Full Text:
PDFReferences
Fawcett, T. (2006). An introduction to ROC analysis. Pattern Recognition Letters, 27(8), 861-874. doi:10.1016/j.patrec.2005.10.010
Russell, S., & Norvig, P. (2003). Artificial intelligence: A modern approach (2nd ed.). Prentice Hall.
Swets, J. A. (1996). Signal detection theory and ROC analysis in psychology and diagnostics: Collected papers. Mahwah, NJ: Lawrence Erlbaum Associates.
Wu, S. (2013). A review on coarse warranty data and analysis. Reliability Engineering and System, 114, 1-11.
DOI: http://dx.doi.org/10.3968/n
Refbacks
- There are currently no refbacks.
Copyright (c) 2017 Yuchen WU
This work is licensed under a Creative Commons Attribution 4.0 International License.
Reminder
We are currently accepting submissions via email only.
The registration and online submission functions have been disabled.
Please send your manuscripts to mse@cscanada.net,or mse@cscanada.org for consideration.
We look forward to receiving your work.
We only use three mailboxes as follows to deal with issues about paper acceptance, payment and submission of electronic versions of our journals to databases:
caooc@hotmail.com; mse@cscanada.net; mse@cscanada.org
Articles published in Management Science and Engineering are licensed under Creative Commons Attribution 4.0 (CC-BY).
MANAGEMENT SCIENCE AND ENGINEERING Editorial Office
Address:1055 Rue Lucien-L'Allier, Unit #772, Montreal, QC H3G 3C4, Canada.
Telephone: 1-514-558 6138
Http://www.cscanada.net Http://www.cscanada.org
Copyright © 2010 Canadian Research & Development Centre of Sciences and Cultures