A repository of databases, domain theories and data generators that are used by the machine learning community for the empirical analysis of machine learning algorithms at the University of California at Irvine. [Free]
Open source version of the Cyc technology, the world's largest and most complete general knowledge base and commonsense reasoning engine. Can be used as the basis of a wide variety of intelligent applications including; rapid development of an ontolog...
The Rapid Miner toolset is an environment for machine learning through use of nested operators. Multiple experiments can be arbitrarily nested together through use of a graphical XML based user interface. (Formerly YALE) [GPL]
A suite of Java libraries for the linguistic analysis of human language which can link entity mentions to database entries, uncover relations, cluster documents, and discover significant trends. [GPL]
Supports several inference algorithms and learning algorithms. Allows simulation of static and dynamic networks, including HMMs, IOHMMs, and Kalman filters. [GPL]
This tool implements Hidden Markov Models and application to part-of-speech tagging. Also available; a multivariate hypothesis testing software for gaussian data, and a groundtruth/metadata editing and visualizing toolkit for OCR. [GPL]
An algorithm engine which will calculate everything from symmetry, torsion angles, polar fraction through protein analysis and bond angles. Online version only. [Free]
This package forms a complete gradient descent machine learning library. Modules support vector machines in classification and regression, ensemble models such as bagging or adaboost, non-parametric models such as K-nearest neighbors, Parzen regressio...
Using algorithms to address issues of searching and matching strings and more complicated patterns such as trees, regular expressions, graphs, point sets, and arrays. [GPL]
A program compiler with built-in machine learning to find the most efficient compilation possible based upon the processor the program is run on. A collective optimization database, predictor web-service, and frameworks are available in order to make ...