UPH Digital Library Miner: A Topic Modelling-based Software Application for Mining Document Collections of a Digital Library

dc.contributor.authorOlowookere, Toluwase Ayobami
dc.date.accessioned2022-05-05T18:41:23Z
dc.date.available2022-05-05T18:41:23Z
dc.date.issued2015-12
dc.description.abstractWith changing user expectations, many traditional libraries are moving toward digital content storage. Accessible from anywhere at any time, digital contents as stored in digital libraries provide users with efficient, on-demand information experiences. With this trend, the amount of digital contents especially digital text documents made available to users have tremendously increased over the years, being filled with hidden information in form of the varieties of topics of discourse inherent in them leading to information overload. Accordingly, users, mostly computational researchers are presented with challenges on the discovery and identification of the varieties of topical contents of the collections in the digital library thus making it imperative to develop a means to automatically discover the topics that pervade the collections in a digital library. This paper therefore presents UPH Digital Library Miner, a software application for mining document collections of a digital library for topical structure discovery and topic-based similarities search between collection pairs, using topic modeling algorithm and inverted Kullback-Leibler divergence measure. The application is integrated with document collections built in a widely used digital library software system— Greenstone digital library system, via loose-coupling integration approach. Results obtained from using this software application on the Greenstone’s document collections that contain abstracts of about 628 documents from IEEE transactions on Software Engineering show its ability to discover latent topical structures in collections and also report collections that are similar based on their discovered topical structure.en_US
dc.identifier.issn0975 – 8887
dc.identifier.urihttp://dspace.run.edu.ng:8080/jspui/handle/123456789/2706
dc.language.isoenen_US
dc.publisherInternational Journal of Computer Applicationsen_US
dc.subjectDigital libraryen_US
dc.subjectDocument collectionen_US
dc.subjectText miningen_US
dc.subjectTopic modelingen_US
dc.subjectTopical structureen_US
dc.titleUPH Digital Library Miner: A Topic Modelling-based Software Application for Mining Document Collections of a Digital Libraryen_US
dc.typeArticleen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
UPH_Digital_Library_Miner_A_Topic_Modelling-based_.pdf
Size:
957.67 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: