A java library for the processing and analysis of natural language texts and other sequential data. The focus is on unsupervised modeling with simple, statistical methods as well as implementations of more complex algorithms.
This is attempt to make an open source bookmarking system that supports tagging, distributed data storage, genetic "splicing" of strains of bookmarking tags and much more!
Facilitates data mining/natural language processing experiments to be executed on weblogs, such as classification, clustering and rating. As part of these experiments, it is possible to apply Latent Semantic Analysis.
This project will provide translation of mathematical content, from TeX to MathML and vice-versa, and to graphics formats, as a web service. TeX, running as a daemon, is used for mathematical typography.
Yabman is a tool for managing bibliographic references. Its key features are a quality user interface, a carefully designed data model, and sophisticated three-state hierarchical reference labeling. It is currently usable but in a pre-alpha stage.