The ELKI framework is written in Java and built around a modular architecture. Most currently included algorithms belong to clustering, outlier detection and database indexes. A key concept of ELKI is to allow the combination of arbitrary algorithms, data types, distance functions and indexes and evaluate these combinations. When developing new algorithms or index structures, the existing components can be reused and combined.
ELKI is modeled around a database core, which uses a vertical data layout that stores data in column groups (similar to column families in NoSQL databases). This database core provides nearest [...]
What is Data Mining ? Data Mining is the computational process of discovering patterns, trends and behaviors, in large data sets using artificial intelligence, machine learning, statistics, and database systems. The overall goal of the data mining process is to extract information from a data set and transform it into an understandable structure for further use. Data mining is considered as a synonym for another popularly used term, known as KDD, knowledge discovery in databases. Data mining is an essential step in the process of predictive analytics. The process of mining and extraction of useful information from the existing data is an interdisciplinary [...]
What is Data Analysis ? : The process of inspecting, cleaning, transforming, and modeling data with the objective of discovering useful information, arriving at conclusions, and supporting the decision making process is called Data Analysis. There are multiple facets and approaches with diverse techniques for the data analysis. The data analysis in statistics are generally divided into descriptive statistics, exploratory data analysis (EDA), and confirmatory data analysis (CDA). Data need to be cleaned. Data cleaning is the process of correcting the outliers and other incorrect and unwanted information. There are several types of data cleaning process to [...]
Neural Designer is a desktop application for data mining which uses neural networks, a main paradigm of machine learning. The software is developed by the startup company called Intelnics, based in Spain and founded by Roberto Lopez and Ismael Santana. Neural networks are mathematical models of the brain function, computational models which are inspired by central nervous systems, in particular the brain, which can be trained to perform certain tasks. Neural networks are capable of machine learning as well as pattern recognition. Neural networks are generally presented as systems of interconnected neurons, which can compute outputs from inputs. Neural [...]
Weka Data Mining : Weka is a collection of machine learning algorithms for data mining tasks. The algorithms can either be applied directly to a dataset or called from your own Java code. Weka features include machine learning, data mining, preprocessing, classification, regression, clustering, association rules, attribute selection, experiments, workflow and visualization. Weka is written in Java, developed at the University of Waikato, New Zealand. All of Weka’s techniques are predicated on the assumption that the data is available as a single flat file or relation, where each data point is described by a fixed number of attributes Weka provides [...]