Academic Year |
2024Year |
School/Graduate School |
School of Informatics and Data Science |
Lecture Code |
KA215001 |
Subject Classification |
Specialized Education |
Subject Name |
データマイニング |
Subject Name (Katakana) |
データマイニング |
Subject Name in English |
Data Mining |
Instructor |
MORIMOTO YASUHIKO |
Instructor (Katakana) |
モリモト ヤスヒコ |
Campus |
Higashi-Hiroshima |
Semester/Term |
3rd-Year, First Semester, 1Term |
Days, Periods, and Classrooms |
(1T) Thur1-4:ENG 103 |
Lesson Style |
Lecture |
Lesson Style (More Details) |
|
Lecture and related exercises by using a computer 【Online/ondemand style class by using Moodle and Teams】 |
Credits |
2.0 |
Class Hours/Week |
|
Language of Instruction |
B
:
Japanese/English |
Course Level |
2
:
Undergraduate Low-Intermediate
|
Course Area(Area) |
25
:
Science and Technology |
Course Area(Discipline) |
02
:
Information Science |
Eligible Students |
Senior Level |
Keywords |
Knowledge Discovery, Information Retrieval, Discovery Science, Large-scale data processing, Big Data, RESAS (Regional Economy Society Analyzing System) |
Special Subject for Teacher Education |
|
Special Subject |
|
Class Status within Educational Program (Applicable only to targeted subjects for undergraduate students) | |
---|
Criterion referenced Evaluation (Applicable only to targeted subjects for undergraduate students) | Integrated Arts and Sciences (Knowledge and Understanding) ・Knowledge and understanding of the importance and characteristics of each discipline and basic theoretical framework. (Abilities and Skills) ・The ability and skills to collect and analyze necessary literature or data among various sources of information on individual academic disciplines. ・The ability and skills to specify necessary theories and methods for consideration of issues.
Computer Science Program (Abilities and Skills) ・A. Information infrastructure development technology, information processing technology, technology that analyzes data and creates new added value.
Data Science Program (Knowledge and Understanding) ・D1. Knowledge and ability to understand the theoretical framework of statistics and data analysis and to analyze qualitative/quantitative information of big data accurately and efficiently. (Abilities and Skills) ・A. Information infrastructure development technology, information processing technology, technology that analyzes data and creates new added value.
Intelligence Science Program (Knowledge and Understanding) ・D1. A deep systematic understanding of the advanced intelligence of human beings and its realization by computers. (Abilities and Skills) ・A. Information infrastructure development technology, information processing technology, technology that analyzes data and creates new added value. |
Class Objectives /Class Outline |
- To understand how to retrieve necessary information from stored databases efficiently and effectively - To understand what is valuable information and how to extract valuable information - To understand important issues for handling large-scale data |
Class Schedule |
1-2: Guidance and recent data innovations Overview of data mining technology and introduction of recent research results and case studies 3-4: Database management system Basics of database management systems (relational algebra, SQL query language, consistency constraints, transactions, exclusive control, etc.) 5-6: Multidimensional analysis Many facts can be understood by aggregating databases from various directions and analyzing trends. Here, you will learn and practice aggregation operations necessary for multidimensional analysis using familiar real data. 7-8: Exercise data and data mining tools The exercise uses data from the Regional Economic Analysis System (RESAS) provided by the Ministry of Economy, Trade and Industry and the Secretariat of the Council for the Realization of the Digital Garden City National Vision, Cabinet Secretariat, and analyzes various knowledge for regional revitalization through advanced data analysis. We will excavate it through. WEKA, known as a data mining tool, is used as an analysis tool. 9-10: Association rules Definition of association rules and association rule discovery algorithms (A priori algorithm, FP-Tree, FP-Growth, etc.) 11-12: Prediction model Definition of discriminant rules, their optimization methods, and decision trees/regression trees 13-14: Clustering Define data similarity and group data based on similarity 15: Data mining application Related topics such as familiar application examples such as CRM, recommendation systems, and search engines, NoSQL databases such as key-value stores, and map reduce calculation platforms will be introduced in the 15th lecture and during gaps in the lectures. |
Text/Reference Books,etc. |
Reference Book: Jiawei Han, Micheline Kamber共著「Data Mininig: Concepts and Technologies」(Morgan Kaufmann) |
PC or AV used in Class,etc. |
|
(More Details) |
PowerPoint and Handout |
Learning techniques to be incorporated |
|
Suggestions on Preparation and Review |
Compute the results of sample databases by yourself and confirm the results and how they work. |
Requirements |
|
Grading Method |
We will ask you to submit a practice report. |
Practical Experience |
Experienced
|
Summary of Practical Experience and Class Contents based on it |
The lecturer has been in charge of data mining research and development and product development at a corporate research institute for many years, and has a track record of introducing this technology to securities companies, hospitals, etc. |
Message |
|
Other |
Since this lecture is aimed at information science students who must bring Windows PC, the exercise materials were created for Windows users. If your PC is a Mac, we recommend that you enable Windows to run on your Mac in advance. (The exercise content also works on Mac. However, advanced PC knowledge is required in order to adjust Mac-specific operations. Because there are few TAs, those who are not confident that they can adjust windows exercises for Mac on their own should use Windows on Mac.) |
Please fill in the class improvement questionnaire which is carried out on all classes. Instructors will reflect on your feedback and utilize the information for improving their teaching. |