Data Mining; Tantangan Baru dalam Era Big Data

Artikel

Bismillah, semester ini saya mengajar di Program Magister Administrasi Publik Unsoed mata kuliah Statistika dan Data Mining.

Mata kuliah ini sebelumnya hanya berfokus pada statistika untuk sektor publik. Namun mulai semester ini, sebagai respon terhadap perkembangan teknologi yang diwarnai oleh fenomena Big Data, maka mata kuliah ini ditambah dengan materi tentang Data Mining.

Data mining bisa kita artikan sebagai penambangan data. Sebagaimana kita rasakan bersama, kita sekarang ini sudah dikelilingi oleh  Big Data yang tertanam dalam berbagai platform media sosial, sistem informasi, dan media digital lainnya. Data mining ini berusaha untuk mengambil data yang banyak tersebar, dengan metode tertentu, kemudian diolah dan dianalisis sehingga dapat diketahui pola dan kecenderungannya. Dari data yang berserakan tanpa pola dan tidak bermanfaat, dengan data mining menjadi bermakna dan bermanfaat untuk berbagai keperluan.

Dalam menambang data, ada beberapa tools yang dapat digunakan baik yang free maupun berbayar. Atas dasar pertimbangan kemampuan mahasiswa, tool yang saya gunakan adalah Rapid Miner, yang bisa didownload secara free, dan kita bisa punya akun versi akademisi.

Berikut tautan untuk mendapatkan softwerenya https://rapidminer.com/get-started/

Buku yang saya pakai sebagai referensi antara lain buku yang berjudul:


Book title: Data Mining for the Masses

Author(s): Dr. Matthew A North

Publisher: Global Text Project, Year: 2012

ISBN: 978-0615684376

Book link


Book title: Predictive Analytics and Data Mining: Concepts and Practice with RapidMiner

Author(s): Vijay Kotu, Bala Deshpande

Publisher: Morgan Kaufmann, Year: 2014

ISBN: 0128014601,9780128014608

Description:
Put Predictive Analytics into ActionLearn the basics of Predictive Analysis and Data Mining through an easy to understand conceptual framework and immediately practice the concepts learned using the open source RapidMiner tool. Whether you are brand new to Data Mining or working on your tenth project, this book will show you how to analyze data, uncover hidden patterns and relationships to aid important decisions and predictions. Data Mining has become an essential tool for any enterprise that collects, stores and processes data as part of its operations. This book is ideal for business users, data analysts, business analysts, business intelligence and data warehousing professionals and for anyone who wants to learn Data Mining. You’ll be able to: 1. Gain the necessary knowledge of different data mining techniques, so that you can select the right technique for a given data problem and create a general purpose analytics process. 2. Get up and running fast with more than two dozen commonly used powerful algorithms for predictive analytics using practical use cases. 3. Implement a simple step-by-step process for predicting an outcome or discovering hidden relationships from the data using RapidMiner, an open source GUI based data mining tool

Book link


Book title: RapidMiner: Data Mining Use Cases and Business Analytics Applications

Author(s): Markus Hofmann, Ralf Klinkenberg

Series: Chapman & Hall/CRC Data Mining and Knowledge Discovery Series

Publisher: Chapman and Hall/CRC, Year: 2013

ISBN: 978-1-4822-0550-3,978-1-4822-0549-7

Description:

Powerful, Flexible Tools for a Data-Driven World
As the data deluge continues in today’s world, the need to master data mining, predictive analytics, and business analytics has never been greater. These techniques and tools provide unprecedented insights into data, enabling better decision making and forecasting, and ultimately the solution of increasingly complex problems.

Learn from the Creators of the RapidMiner Software
Written by leaders in the data mining community, including the developers of the RapidMiner software, RapidMiner: Data Mining Use Cases and Business Analytics Applications provides an in-depth introduction to the application of data mining and business analytics techniques and tools in scientific research, medicine, industry, commerce, and diverse other sectors. It presents the most powerful and flexible open source software solutions: RapidMiner and RapidAnalytics. The software and their extensions can be freely downloaded at www.RapidMiner.com.

Understand Each Stage of the Data Mining Process
The book and software tools cover all relevant steps of the data mining process, from data loading, transformation, integration, aggregation, and visualization to automated feature selection, automated parameter and process optimization, and integration with other tools, such as R packages or your IT infrastructure via web services. The book and software also extensively discuss the analysis of unstructured data, including text and image mining.

Easily Implement Analytics Approaches Using RapidMiner and RapidAnalytics
Each chapter describes an application, how to approach it with data mining methods, and how to implement it with RapidMiner and RapidAnalytics. These application-oriented chapters give you not only the necessary analytics to solve problems and tasks, but also reproducible, step-by-step descriptions of using RapidMiner and RapidAnalytics. The case studies serve as blueprints for your own data mining applications, enabling you to effectively solve similar problems.

Book link