Virginia Tech. Digital Library Research LaboratoryVirginia Tech. Department of Computer ScienceChen, LiangzheLin, XiaoWood, AndrewFox, Edward A.Chitturi, KiranKanan, Tarek2015-05-222015-05-222012-11-17http://hdl.handle.net/10919/52539This module introduces algorithms and evaluation metrics for flat clustering. We focus on the usage of LucidWorks big data analysis software and Apache Mahout, an open source machine learning library in clustering of document collections with the k-means algorithm.12 pagesapplication/pdfen-USIn CopyrightComputer scienceDigital librariesText clusteringLucidworksApache mahoutText Clustering Using LucidWorks and Apache MahoutLearning objecthttp://curric.dlib.vt.edu/modDev/lucidworks_modules/CS5604F2012Module-LucidWorks-Clustering.pdf