DSpace Repository

Monitoring email transaction logs by text-mining email contents

Show simple item record

dc.contributor.advisor Vatcharaporn Esichaikul (Chairperson) en_US
dc.contributor.author Chanawut Juntapoln en_US
dc.contributor.other Guha, Sumanta (Member) en_US
dc.contributor.other Dailey, Matthew N. (Member) en_US
dc.date.accessioned 2015-01-12T10:38:54Z
dc.date.available 2015-01-12T10:38:54Z
dc.date.issued 2011-05 en_US
dc.identifier.other AIT RSPR no.CS-11-03 en_US
dc.identifier.uri http://www.cs.ait.ac.th/xmlui/handle/123456789/159
dc.description Submitted in partial fulfillment of the requirements for the degree of Master of Engineering in Computer science. en_US
dc.description.abstract Email transaction logs can be used to track the unusual events which might appear among message conversations. The problem is that to monitor every single email is taking a lot of effort especially when the size of email transaction log is very large. Therefore, it should be a wise option to keep monitoring only the important emails. This research focuses on email message filtering and email content monitoring which are used to create a prototype system of email monitoring in this study. When the filtering percentage and the query terms are obtained, the email message filtering module finds and selects only important messages based on the number of obtained percentage using DFS (Depth First Search algorithm) together with email scoring model. Then, the email content monitoring module analyzes and scores each of email contents comparing to the obtained query terms using VSM (Vector Space Model) together with WordNet database. Finally, the outcome is a list of ranked messages based on their similarity scores. Time to proceed all the system modules rapidly increased when the number of messages was raised up which means that the filtering module can help to relieve the time-consuming problem especially in a large scale of email transaction logs. In addition, users can manually adjust the most suitable filtering percentage provided by the configuration components. The experiments also showed that enhancing terms by WordNet helped in rising up the precision value of the system by increasing the opportunity to match among similar terms. Furthermore, increasing the number of messages to be analyzed also has significantly raised the recall value of the system. en_US
dc.description.sponsorship Royal Thai Government Fellowship en_US
dc.language.iso eng en_US
dc.publisher Asian Institute of Technology en_US
dc.subject Data mining en_US
dc.subject Electronic mail en_US
dc.subject Vector space model en_US
dc.subject Data filtering en_US
dc.subject.lcsh Others en_US
dc.title Monitoring email transaction logs by text-mining email contents en_US
dc.type Research Report en_US
dc.rights.holder Copyright (C) 2011 by Asian Institute of Technology. en_US

Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace

Advanced Search


My Account