Anaconda is an open data science platform powered by Python. The open source version of Anaconda is a high performance distribution of Python and R and includes over 100 of the most popular Python, R and Scala packages for data science. There is also access to over 720 packages that can easily be installed with conda, the package, dependency and environment manager, that is included in Anaconda.Includes the most popular Python, R & Scala packages for stats, data mining, machine learning, deep learning, simulation & optimization, geospatial, text & NLP, graph & network, image analysis. Featured packages include: NumPy, SciPy, pandas, [...]
R is the world’s most powerful, and preferred, programming language for statistical computing, machine learning, and graphics, and is supported by a thriving global community of users, developers, and contributors.The Microsoft R product family includes: Microsoft R Server, Microsoft R Client, Microsoft R Open, SQL Server R Services.Microsoft R Server is the most broadly deployable enterprise-class analytics platform for R . Supporting a variety of big data statistics, predictive modeling and machine learning capabilities, R Server supports the full range of analytics exploration, analysis, visualization and modeling based on open source R. Microsoft R [...]
Scikit-learn is an open source machine learning library for the Python programming language.It features various classification, regression and clustering algorithms including support vector machines, random forests, gradient boosting, k-means and DBSCAN, and is designed to interoperate with the Python numerical and scientific libraries NumPy and SciPy.
Classification : Identifying to which category an object belongs to Applications: Spam detection, Image recognition. Algorithms: SVM, nearest neighbors, random forest. Regression : Predicting a continuous-valued attribute associated with an object. Applications: Drug [...]
Dataiku DSS is the collaborative data science platform that enables teams to explore, prototype, build, and deliver their own data products more efficiently. Dataiku DSS provides an interactive visual interface where they can point, click, and build or use languages like SQL to data wrangle, model, easily re-run workflows, visualize results, and get up-to-date insights on demand.
Dataiku DSS provides tools to draft data preparation and modelisation in seconds, that wish to leverage their favorite ML libraries (scikitlearn, R, MLlib, H2O, and so on), and that rely on automating their work in a completely customizable [...]
Actian Vector Express is a free community version of the Actian Analytics Platform designed to provide a fast and simple way to improve the performance of your analytics. It is built on top of our record breaking vector based analytics database, Actian Express delivers unmatched performance and price/performance and requires less hardware and virtually no tuning.
Actian Vector Express includes the following capabilities: Analytics Workbench – quickly build visual workflows to prepare, transform, and analyze data, Analytics Database – run complex queries against billions of records in seconds and Management [...]
Actian Express- Hadoop SQL Edition delivers Big Data Value : Actian Analytics Platform, Express Hadoop SQL Edition, is a free community version of the end-to-end analytics platform running 100 percent inside of Hadoop. With no limits on the number of Hadoop nodes, and data up to 500GB, Actian Express, Hadoop SQL Edition supercharges Hadoop adoption and accelerates time-to-value for organizations that have been struggling to get value from their Hadoop investments. Hadoop has proven to be an unbeatable data reservoir given its scalability and cost effectiveness. But Hadoop on its own can present adoption and performance challenges for organizations that [...]
GraphLab Create : GraphLab Create is a machine learning platform to build intelligent, predictive application involving cleaning the data, developing features, training a model, and creating and maintaining a predictive service. These intelligent applications provide predictions for use cases including recommenders, sentiment analysis, fraud detection, churn prediction and ad targeting. Trained models can be deployed on Amazon Elastic Compute Cloud (EC2) and monitored through Amazon CloudWatch. They can be queried in real-time via a RESTful API and the entire deployment pipeline is seen through a visual dashboard. The time from prototyping to production [...]
HP Haven Predictive Analytics : HP Haven Predictive Analytics is powered by HP Vertica and Distributed R. Distributed R is a high performance analytical engine based on the open source R language developed with HP Labs to address the most demanding, Big Data predictive analytics tasks. Distributed R improves performance and enables users to analyze much larger data sets than was previously possible with the popular R statistical programing language.
Haven Predictive Analytics provides data acceleration and native SQL [...]