Mark Watson’s Artificial Intelligence Books and Blog

Share this post

I open sourced my Java KBtextmaster project

markwatson.com

I open sourced my Java KBtextmaster project

Mark Watson
Mar 7, 2005
Share

KBtextmaster reads a variety of document formats (Word, Powerpoint, PDF, OpenOffice.org, AbiWord) and performs categorization, summarization, part of speech tagging, document clustering, and indexing/search using Lucene.

You can get it here. It is released under the GPL, with alternative licenses available if the GPL does not work for your project.

Share
Comments
Top
New

No posts

Ready for more?

© 2023 Mark Watson
Privacy ∙ Terms ∙ Collection notice
Start WritingGet the app
Substack is the home for great writing