Textual Data Mining Applications for Industrial Knowledge Management Solutions

Ur-Rahman, Nadeem

Thesis-2010-UrRahman.pdf (841 kB)

Textual Data Mining Applications for Industrial Knowledge Management Solutions

thesis

posted on 2010-06-22, 09:05 authored by Nadeem Ur-Rahman

In recent years knowledge has become an important resource to enhance the business and many activities are required to manage these knowledge resources well and help companies to remain competitive within industrial environments. The data available in most industrial setups is complex in nature and multiple different data formats may be generated to track the progress of different projects either related to developing new products or providing better services to the customers. Knowledge Discovery from different databases requires considerable efforts and energies and data mining techniques serve the purpose through handling structured data formats. If however the data is semi-structured or unstructured the combined efforts of data and text mining technologies may be needed to bring fruitful results. This thesis focuses on issues related to discovery of knowledge from semi-structured or unstructured data formats through the applications of textual data mining techniques to automate the classification of textual information into two different categories or classes which can then be used to help manage the knowledge available in multiple data formats. Applications of different data mining techniques to discover valuable information and knowledge from manufacturing or construction industries have been explored as part of a literature review. The application of text mining techniques to handle semi-structured or unstructured data has been discussed in detail. A novel integration of different data and text mining tools has been proposed in the form of a framework in which knowledge discovery and its refinement processes are performed through the application of Clustering and Apriori Association Rule of Mining algorithms. Finally the hypothesis of acquiring better classification accuracies has been detailed through the application of the methodology on case study data available in the form of Post Project Reviews (PPRs) reports. The process of discovering useful knowledge, its interpretation and utilisation has been automated to classify the textual data into two classes.

History

School

Mechanical, Electrical and Manufacturing Engineering

Publisher

Publication date

2010

Notes

A Doctoral Thesis. Submitted in partial fulfillment of the requirements for the award of Doctor of Philosophy of Loughborough University.

EThOS Persistent ID

uk.bl.ethos.519965

Usage metrics

Keywords

Knowledge discovery Knowledge management Data mining Text mining Clustering MKTPKS Termset Mining Decision trees K-nearest Neighbouring (KNN)Naïve Bayes Support Vector Machines (SVMs)Post Project Reviews (PPRs)Mechanical Engineering not elsewhere classified

Licence

CC BY-NC-ND 4.0

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

Textual Data Mining Applications for Industrial Knowledge Management Solutions

History

School

Publisher

Publication date

Notes

EThOS Persistent ID

Usage metrics

Categories

Keywords

Licence

Exports