Top 50 Free Statistical Software
Top 50 Free Statistical software: List of 50+ open source statistical software. Statistical software are programs which are used for the statistical analysis of the collection, organization, analysis, interpretation and presentation of data. SAS University Edition, GNU Octave, ADaMSoft, BV4.1, PSPP, R, pbdR, Shogun, CSPro, CumFreq, Gretl. GDL, OpenMx, OpenStat, MaxStat Lite version, Sage, DAP, Dashboard of Sustainability, Epi Info, Develve, Salstat, Simfit, SOFA Statistics, First Bayes, StatCVS, Statistical Lab, WinBUGS, ViSta, WinPepi, SCaViS, DAP, ADMB, OpenEpi, Ploticus, Orange, NCAR Command Language, Perl Data Language, Scilab, SciPy, Yorick, IDAMS, EasyReg, IVEware, Zelig, Statcato, MacAnova, Dataplot, and Arc are some of the top free statistical analysis software in no particular order.
You may also like to review the proprietary statistical software solutions list which is given below:
Here is a list of some of the Top Free Statistical software from the open source vendors.
Top Free Statistical software
SAS University Edition, GNU Octave, ADaMSoft, BV4.1, PSPP, R, pbdR, Shogun, CSPro, CumFreq, Gretl. GDL, OpenMx, OpenStat, MaxStat Lite version, Sage, DAP, Dashboard of Sustainability, Epi Info, Develve, Salstat, Simfit, SOFA Statistics, First Bayes, StatCVS, Statistical Lab, WinBUGS, ViSta, WinPepi, SCaViS, DAP, ADMB, OpenEpi, Ploticus, Orange, NCAR Command Language, Perl Data Language, Scilab, SciPy, Yorick, IDAMS, EasyReg, IVEware, Zelig, Statcato, MacAnova, Dataplot, and Arc
1.SAS University Edition
SAS University Edition includes the most recent releases of SAS Studio, Base SAS, SAS/STAT, SAS/IML and SAS/ACCESS. Features include intuitive interface that interact with the software, a powerful programming language that’s easy to learn, easy to use, comprehensive, reliable tools that include state-of-the-art statistical methods and a robust and flexible matrix programming language for more in-depth, specialized analysis and exploration.SAS University Edition provides easy access to statistical software for research and other courses like economics, social sciences, computer science, business, medical, health and engineering.Once downloaded, the software can be used in a standalone PC, Mac or a Linux workstation. The package also includes e learning classes and training videos and access to the SAS Analytics U community, where you can interact with users, share ideas and access more SAS resources.
Dataiku DSS is the collaborative data science platform that enables teams to explore, prototype, build, and deliver their own data products more efficiently. Dataiku DSS provides an interactive visual interface where they can point, click, and build or use languages like SQL to data wrangle, model, easily re-run workflows, visualize results, and get up-to-date insights on demand. Dataiku DSS provides tools to draft data preparation and modelisation in seconds, that wish to leverage their favorite ML libraries (scikitlearn, R, MLlib, H2O, and so on), and that rely on automating their work in a completely customizable interface. Data Ops.
GNU Octave is for numerical computations and it provides a command-line interface for solving linear and nonlinear problems and for performing other numerical experiments.
PSPP is a software used for the analysis of sampled data with a graphical user interface and conventional command-line interface. It is a alternative for IBM SPSS Statistics and is written in C. It has a graphical user interface and conventional command-line interface. PSPP functionality includes descriptive statistics, T-tests, anova, linear and logistic regression, cluster analysis, reliability and factor analysis, non-parametric tests and more.PSPP can generate high quality plots to help with visualisation of the distribution of data. These are box-and-whisker plots, normal probability plots and histograms.
ADaMSoft is an open source statistical software developed in Java which supports Neural Networks MLP, Graphs, Data Mining, Linear regression, Logistic regression, Statistical classification, Record linkage methods, Decision trees, Cluster analysis, Data Editing and imputation, Principal component analysis and Correspondence analysis.
BV4.1 tool for decomposing time series using Berlin procedure. BV4.1 decomposes and seasonally adjust monthly or quarterly economic time series using the Berlin procedure.
R is a free implementation of the S language and a software environment for statistical computing and graphics. R Commander and Rattle GUI are graphical user interface for R. R language is widely used among statisticians and data miners for data analysis. Statistical and graphical techniques supported includes linear and nonlinear modeling, classical statistical tests, time-series analysis, classification, clustering.One of R’s strengths is the ease with which well-designed publication-quality plots can be produced, including mathematical symbols and formulae where needed.
pbdR is a series of R packages which are enhanced by SPMD parallelism for big data analysis. The “Programming with Big Data in R” project (pbdR) enables high-level distributed data parallelism in R, so that it can easily utilize large HPC platforms with thousands of cores, making the R language scale to unparalleled heights.
Shogun is a large scale machine learning toolbox which provides several support vector machine implementations. There are also interfaces to Octave, Matlab, Python and R. SHOGUN is designed for unified large scale learning for a broad range of feature types and learning settings, like classification, regression, or explorative data analysis.
CSPro, is Census and Survey Processing System, is developed by the U.S. Census Bureau and ICF International. CSPro software is used for entering, editing, tabulating, mapping, and disseminating census and survey data.
CumFreq is a tool for cumulative frequency analysis of a single variable and for probability distribution fitting.
Gretl is Gnu Regression, Econometrics and Time-series Library . This is used mainly for econometrics with a graphical user interface. A wide variety of estimators: least squares, maximum likelihood, GMM; single-equation and system methods and Time series methods: ARIMA, GARCH, VARs and VECMs, unit-root and cointegration tests, Kalman filter, etc are available.
12. GNU Data Language
GNU Data Language (GDL) is a free alternative to IDL (Interactive Data Language).GDL is developed to serve as a tool for data analysis and visualization. GDL as a language is dynamically-typed, vectorized, and has object-oriented programming capabilities. GDL library routines handle numerical calculations (e.g. FFT), data visualisation, signal/image processing, interaction with host OS, and data input/output.
OpenMx a package under R for extended structural equation modeling which allows estimation of a wide variety of advanced multivariate statistical models. OpenMx consists of a library of functions and optimizers that allow you to quickly and flexibly define an SEM model and estimate parameters given observed data.
OpenStat contains a large variety of parametric, nonparametric, multivariate, measurement, statistical process control, financial and other procedures which also lets to simulate a variety of data for tests, theoretical distributions, multivariate data, etc
15. MaxStat Lite version
MaxStat Lite version is a easy to use for statistical analysis in three easy steps within a single dialog box. MaxStat supports over 100 commonly used statistical tests and makes it easy to interpret results and create high-quality graphs. Maxstat includes descriptive, hypothesis, linear and nonlinear regression, correlation, multivariate analysis, and time series.
Sage is System for Algebra and Geometry Experimentation and covers many aspects of mathematics, including algebra, combinatorics, numerical mathematics, number theory, and calculus. It combines the power of many existing open-source packages into a common Python-based interface.
Dap performs data management, analysis, and graphical visualization tasks. This is a command line driven program which perform tests on means and percentiles, correlation, ANOVA, categorical analysis, linear and logistic regression analysis and non parametric statistics.
18.Dashboard of Sustainability
The Dashboard of Sustainability is a software package configured to model the complex relationships among economic, social, and environmental issues. The software is designed to help developing countries achieve the Millennium Development Goals.
Epi Info is statistical software for epidemiology developed by Centers for Disease Control and Prevention (CDC). The program allows for electronic survey creation, data entry, and analysis.
Develve statistical software is a free software for analysis which provides a maximum overview of your data. With no deep hidden menus, everything is directly accessible and the results are directly visible. The capabilities of the program including Normal distributed data ( t-test difference in mean, F-test variation test, One way anova, Sample size calculations ), Not normal distributed data (Mann Whitney test difference in Median, Wilcoxon test difference in Median, Levene variation test, normality test), Proportions, Correlation test etc.
Salstat is used in the statistical analysis of numeric data in a graphical user interface and command line interface.It can perform a range of tests already, from descriptive statistics through to analysis of variance tests and their nonparametric equivalents.
Simfit is a windows package for simulation, curve fitting, statistics, and plotting, using a library of models or user-defined equations.
23. SOFA Statistics
SOFA Statistics is Statistics Open For All with a graphical user interface .The main statistical tests available are Independent and Paired t-tests, Wilcoxon signed ranks, Mann–Whitney U, Pearson’s chi squared, Kruskal Wallis H, one-way ANOVA, Spearman’s R, and Pearson’s R.
First Bayes is a teaching package for elementary Bayesian Statistics. First Bayes is intended as an aid to learning Bayesian Statistics, in conjunction with a course or suitable reading.
Past is free software for scientific data analysis, with functions for data manipulation, plotting, univariate and multivariate statistics, ecological analysis, time series and spatial analysis, morphometrics and stratigraphy.
MicrOsiris is a comprehensive statistical and data management package for Windows.MicrOsiris includes special techniques for data mining (SEARCH) and analysis of nominal- and ordinal-scaled data (MNA, MCA) and an interface to Michigan Survey Research Center’s missing values imputation variance estimation and regression software for complex sampling designs, IVEware .
ViSta, is a Visual Statistics System,features statistical visualizations that are highly dynamic and very interactive. ViStaconstructs very-high-interaction, dynamic graphics that show multiple views of data simultaneously. The graphics are designed to augment the visual intuition to better understand data.
StatCVS written in Java generates graphical reports about CVS modules. StatCVS retrieves information from a CVS repository and generates various tables and charts describing the project development.
29. Statistical Lab
Statistical Lab is an explorative and interactive toolbox for statistical analysis and visualization of data. The graphical user interface is designed to make complex statistical relations easy to understand. It connects and displays data frames, frequency tables, random numbers or matrixes.
WinBUGS is a statistical for Bayesian analysis using Markov chain Monte Carlo (MCMC) methods.
WinPepi is a package of statistical programs for epidemiologists, comprising seven programs with over 120 modules.
SCaViS,is a java based statistical analysis framework for scientific computation, data analysis and data visualization. The package supports several mathematical, data-analysis and data mining features such as 2D and 3D interactive visualization of data, functions, histograms, charts, random numbers and statistical samples, contour plots, scatter plots, neural networks, linear regression and curve fitting using several minimization techniques, Cluster analysis,Cellular automaton.
WinIDAMS is a software package for the validation, manipulation and statistical analysis of data, developed by the UNESCO Secretariat in co-operation with experts from various countries. It is distributed free-of-charge upon request.
ADMB is a software suite for non linear statistical modeling based on C++ which uses automatic differentiation.AD Model Builder, or ADMB, is for the development of state-of-the-art nonlinear statistical models. ADMB is built around the AUTODIF Library.
Ploticus is an application for generating a variety of graphs from the raw data.
Orange is a component based data mining , machine learning and bioinformatics software.
38.NCAR Command Language
NCAR Command Language is a gratis interpreted language designed by the National Center for Atmospheric Research for scientific visualization and data processing.
39. Perl Data Language
Perl Data Language is a set of free software array programming extensions to the Perl programming language. PDL extends the data structures built into Perl, to include large multidimensional arrays, and adds functionality to manipulate those arrays as vector objects.
Scilab is an open source, cross-platform numerical computational package and a high-level, numerically oriented programming language. It can be used for signal processing, statistical analysis, image enhancement, fluid dynamics simulations, numerical optimization, and modeling, simulation of explicit and implicit dynamical systems.
SciPy is a computing environment and open source ecosystem of software for the Python programming language used by scientists, analysts and engineers doing scientific computing and technical computing.
Yorick is an interpreted programming language designed for numerics, graph plotting and steering large scientific simulation codes. It is quite fast due to array syntax, and extensible via C or Fortran routines.
EasyReg is Easy Regression and conducts various econometric estimation and testing tasks on all 32 bit and 64 bit Windows platforms up to Windows 7.
IVEware is Imputation and Variance Estimation Software which performs Single or multiple imputations of missing values using the Sequential Regression Imputation Method, a variety of descriptive and model based analyses accounting for complex design features such as clustering, stratification and weighting and multiple imputation analyses for both descriptive and model-based survey statistics.
Zelig is a single, easy-to-use program that can estimate, help interpret, and present the results of a large range of statistical methods.
Statcato is a free Java software application developed for elementary statistical computations inlcuding sort, rank, standardize data, generate patterned data, generate random samples,Probability calculations (probability density, cumulative probability, inverse cumulative probability).
MacAnova is a free, open source, interactive statistical analysis program. Its strengths are analysis of variance and related models, matrix algebra, time series analysis (time and frequency domain), and (to a lesser extent) uni- and multi-variate exploratory statistics.Core MacAnova has a functional/command oriented interface, but an increasing number of capabilities are available through a menu/dialog/mouse type interface.
Dataplot is a free, multi-platform software system for scientific visualization, statistical analysis, and non-linear modeling. The target Dataplot user is the researcher and analyst engaged in the characterization, modeling, visualization, analysis, monitoring, and optimization of scientific and engineering processes.
Arc is a free statistical analysis tool for regression problems.
You may also like to review the proprietary statistical software solutions list which is given below: