Skip to main content

Main navigation


 
Avro
It is a data serialization system on JSON defined schemas with APIs present on C, C++, C# and Java. It is OS Independent.
Go
BIRT
Co-founded by Actuate, adds reporting functionalities to Java applications. Is OS Independent.
Go
Blazegraph
It is a highly scalable and high-performance database which is available as open-source and with commercial license. It is OS Independent.
Go
Cassandra
Developed by Facebook, the NoSQL database is nowadays handled by Apache Foundation. It’s used by Netflix, Urban Airship, Twitter, Reddit, Constant Contact, Digg and Cisco. It is OS Independent.
Go
Chukwa
Built on platforms MapReduce and HDFS, it gathers data from larger distributed systems with displaying and analyzing the gathered data. Works on Linux and OS X.
Go
CouchDB
It stores web data in JSON documents accessed through the query using JavaScript. Also offers distributed scaling and fault-tolerant storage. Works on Windows, Android, Linux, OS X.
Go
DataMelt
Can do data mining, statistical analysis, mathematical computation and data visualization. It supports Java and related programming languages including Jython, Groovy, JRuby and Beanshell. It is OS Independent.
Go
ECL
ECL is a full set of tools, comprising of an IDE and debugger in HPCC, with documentation available on HPCC website. It operates on Linux.
Go
FlockDB
Store Twitter social graphs (i.e., who is following or blocking whom) with horizontal scaling and swift reads and writes. Is OS Independent.
Go
Flume
An Apache project, it gathers, aggregates and transfers the required log data from apps to HDFS. It’s robust, fault-tolerant Java-based project. Operates on Windows, Linux and OS X.
Go
Gluster
It provides unified file and objects storage for larger data-sets. Can be scaled to 72 brontobytes, extending Hadoop capabilities on Linux.
Go
GridGain
Offers in-memory processing for quick analysis of the real-time data. Works on windows, Linux, OS X Operating Systems.
Go
Hadoop
Frequently the terms “Hadoop” and “big data” are utilized synonymously. The Apache Foundation sponsors multiple projects that range the competences of Hadoop. Multiple vendors provide supported versions of Hadoop and connected technologies. Works on Windows, Linux and OS X.
Go
Hadoop Distributed File System
It is a primary storage structure for Hadoop. It rapidly replicates data onto numerous nodes in a cluster in order to deliver reliable, speedy performance. Works on Windows, Linux and OS X.
Go
Hbase
HBase is an Apache project, with a non-relational data store for the Hadoop. Functionalities comprise of linear and modular scalability, automatic failover support and more. Is OS independent.
Go
Hibari
It is important big data storage with consistency, availability and quick performance supporting many telecom companies. Is OS Independent.
Go
Hive
It is Hadoop’s data warehouse, offers data summarization and analysis of big data. It uses a SQL-like language, HiveQL and is OS Independent.
Go
HPCC Systems
It is a high performance computing cluster offering better performance to Hadoop. It works on Linux with free community versions and paid ones
Go
Hypertable
Provides effectiveness and quick performance resulting in cost savings. It has both open source and but paid support. Available on Linux, OS X.
Go
Infinispan
Java-based, highly scalable data grid platform used for multi-core architecture and offers distributed cache competences. Is OS Independent.
Go
InfoBright Community Edition
It is a scalable data warehouse with storage up to 50TB and compression up to 40:1 for best-driven performance. Works on Windows, Linux.
Go
Jaspersoft
It is the most used, flexible, cost-effective and deployed BI software across the globe. Has both commercial and open-source versions, includes Big Data reporting solutions and is OS Independent.
Go
Jedox
Includes Palo Web, OLAP Server, Palo for Excel and Palo ETL Server with open source and commercial software-based tools. Is OS Independent.
Go
KEEL
KEEL assists use evaluates algorithms for data mining issues like classification, regression, pattern mining and clustering. It comprises of a big collection of prevailing algorithms that it uses to associate new algorithms. It is OS Independent.
Go
KNIME
Provides user-friendly data processing, integration and analysis. Gartner named KNIME as a “Cool Vendor” in 2010 for analytics, BI and performance. Operates on Windows, Linux and OS X.
Go
Lucene
It offers very quick indexing and searching capabilities for huge datasets. It indexes over 95GB/hour while utilizing modern hardware. It is OS Independent.
Go
Mahout
Offers algorithms for classification, clustering, and collaborative filtering on Hadoop. The project’s objective is to shape scalable machine learning libraries. Is OS Independent.
Go
MapReduce
It is a programming model and framework for creating applications that speedily analyse big data, parallel on the clusters to compute nodes. Utilized by Hadoop and other processing applications with the independent OS.
Go
Neo4j
The global graph database improves performance to 1000x or more vs. the relational databases. It even has advanced versions and works on Windows, Linux.
Go
Oozie
It is an Apache project which is built to coordinate with the scheduling of Hadoop jobs. It triggers jobs at a programmed time or as per data availability. Works on Linux and OS X.
Go
Orange
Provides multiple visualizations and a toolbox of 100+ widgets. Works on Windows, Linux and OS X.
Go
OrientDB
Stores 150,000 documents per second with loading graphs in just a few milliseconds. Supports ACID transactions and the fast indexes.
Go
Pentaho
Provided big data analytics tools to 10,000 companies along with data mining, dashboard and reporting. Operates on Windows, Linux and OS X.
Go
Pig
It is an Apache data analysis tool that uses a textual language known as Pig Latin, producing sequences of programs for Map-Reduce. It assists writing, understanding and maintaining programs with data analysis tasks performed parallelly. It is OS Independent.
Go
R
Build by Bell Laboratories, R is a programming language with an environment for graphics and statistical computing similar to S. The environment comprises of tools that make it simpler to operate data, create graphs, charts and do calculations with Windows, Linux and OS X.
Go
RapidMiner
RapidMiner brings artificial intelligence to the enterprise through an open and extensible data science platform.
Go
Rattle
Makes it simpler for non-programmers to utilize R language by offering a graphical interface for mining of data. Can build models, score datasets and draw graphs. Works on Windows, Linux and OS X.
Go
Redis
Offers in-memory key-value store saved on disk for availing persistence. Supports many programming languages and operates on Linux.
Go
Riak
A powerful open-source and distributed database. Users comprise of Comcast, Voxer, Yammer, Joyent, Boeing, Kiip.me, SEOMoz, Formspring, DotCloud and Danish Government. Works on Linux and OS X.
Go
Solr
It is an advanced enterprise search tool based on Lucene. It empowers search capabilities for larger websites, which includes Netflix, CNET, AOL and Zappos. It is OS Independent.
Go
SpagoBI
Is complete open source business intelligence solution with commercial services, support and training and is OS Independent.
Go
SPMF
It is java based data mining framework, with focus on sequential pattern mining, and has tools for linking rule mining, item set mining and sequential rule mining. It has 46 diverse algorithms and is OS Independent.
Go
Sqoop
It transfers data between RDBMSes, Hadoop and data warehouses. It is a topmost level Apache project now and is OS Independent.
Go
Storm
Owned by Twitter, it provides distributed real-time computation competencies and is called as “Hadoop of real-time.” It’s scalable, fault-tolerant, robust, works with all programming languages, with Linux OS.
Go
Terracotta
It’s “Big Memory” platform that allows enterprise applications to manage and store big data in the server memory, with speedy performance. The company provides open-source and commercial versions of its platform. It is OS Independent.
Go
Terrastore
It offers scalability, elasticity and consistency. Supports range queries, custom data partitioning, push-down predicates, server-side update functions, event processing and reduce querying. Is OS independent.
Go
Weka
Offers data mining algorithms that can be applied to data or use in other Java applications. It’s a fragment of a big machine learning project, sponsored by Pentaho. Operating System: Windows, Linux, OS X.
Go
Zookeeper
It is a centralized service for keeping up configuration details, naming, offering distributed synchronization with group services. APIs are obtainable for Java and C, Python, REST and Perl. Works on Linux, Windows (only development) and OS X (only development).
Go
 A   B   C   D   E   F   G   H   I   J   K   L   M   N   O   P   R   S   T   W   Z 
1010data
It comprises of progressive, built-in analytic functions like distribution analysis, variance, correlation, forecasting and predictive modelling along with machine learning. All these functions are integrated straight into the system, to rum them swiftly on big data volumes.
Go
Actian
It delivers advanced analytics in 3 editions, Extreme Performance, Hadoop SQL and Cloud Edition. These editions help in creating analytics value chain, deliver actionable business value, offer high-level data enrichment, SQL analytics, visual design on Hadoop without MapReduce skills. Provide robust data quality and on-premises applications with cloud edition.
Go
Alteryx
The solution access, integrates, and cleans data sources as Hadoop or NoSQL or Teradata with multiple predictive and spatial tools, in a very simplified workflow environment.
Go
Amazon Web Service
It offers cloud-based analytics to assist you to analyze and further process required data volume, needed for Hadoop clusters, petabyte-scale data warehousing, real-time streaming data and for the orchestration.
Go
Amdocs
Amdocs Insight Big Data platform backs Amdocs analytical apps and data services to enable revenues, drive business competence and improve customer experience.
Go
Cisco
Cisco offers integrated infrastructures as well as analytics to support big data ecosystem, providing a scalable and secure infrastructure with valuable insights.
Go
Cisco
It includes computing, connectivity, storage and the unified management abilities. This architecture is transparent, delivers simplified data and manages integration with enterprise ecosystems.
Go
Cloudera Enterprise
The platform comprises of CDH, the open-source Hadoop with data management and system management solution tools with community advocacy and dedicated support.
Go
CSC
It assists enterprises to get value from their data much more swiftly. Using this tool an enterprise can quickly develop, secure and deploy big data and analytics applications with a central subscription platform that utilizes analytics software, tools and infrastructure.
Go
Datameer
Datameer is a SaaS big data analytics platform used for department-specific deployment. It features Hadoop cloud providers Bigstep and Altiscale. It eases big data analytics environment into a single app on top of Hadoop platform.
Go
DataStax
It enables enterprise-level and integrated data analytics with search, visual management, and the expert support. It is one of the best-distributed database choices for online apps that need swift performance without downtime.
Go
Dell
This solution includes Boomi AtomSphere, Kitenga Analytics Suite, and the SharePlex Connector for Hadoop. The Kitenga Analytics suite offers you with d visualization capabilities and the integrated information modelling in business analytics and big data search platform.
Go
FICO
It provides Big Data Analytics solutions, Predictive Analytics and Business Intelligence software solutions which comprise of Orchestrator, Decision management tool, Decision optimizer, Model builder, Model central, and the complete solution stack.
Go
Flytxt
It is a primary storage structure for Hadoop. It rapidly replicates data onto numerous nodes in a cluster in order to deliver reliable, speedy performance. Works on Windows, Linux and OS X.
Go
Fusion-io
These solutions remove the workload performance deficiencies for Cassandra, MongoDB and the NoSQL databases, like HBASE, while reducing their overheads architectures. Fusion-based solutions provide consistent and predictable performance through the entire database, with an effective system that needs less DRAM, lesser nodes, and utilizes lesser energy.
Go
GE
It co-ordinates with industrial apps for working effectively to optimize complete operational environments.
Go
Gooddata
It is the suit of tools, frameworks and APIs, for BI solutions to collaborate, analyze and visualize data which is built-in the cloud and delivered as a service.
Go
Google
The platform analyses data at the scale of the complete web, with SQL and in an entirely managed, serverless architecture where backend system infrastructure is managed automatically, and you can focus on business insights part.
Go
Google BigQuery
It is a very useful web service which enables companies to explore and analyze giant datasets by utilizing Google’s infrastructure. It can easily analyze billions of rows in just seconds. It is highly scalable with SQL query language. BigQuery helps developers and businesses use data analytics against multi-terabyte datasets in few seconds.
Go
Guavus
It is capable of generating actionable information from broadly distributed and large volume data sets in near real-time. Uses machine learning and computational algorithms to filter actionable data insights.
Go
Hortonworks
HDP platform is used for multi-workload data processing through an array of methods for processing from the batch by interactive to real-time; supported with governance, security, integration and the required operations.
Go
HP
HP’s Big Data Analytics solution comprises of HP Vertica and HP HAVEn. HP HAVEn is a tool which includes software, hardware and services. Big Databe it structured or unstructured can be analysed to drive powerful strategic insights. HP Vertica Dragline let companies store their data in a cost-efficient manner and offers competences to explore it swiftly utilizing SQL based tools.
Go
HPCC Systems
This system is an open-source solution platform for Big Data analysis. It has a data Refinery engine known as Thor, that cleans, links transforms and analyses the Big Data. The Thor tool supports ETL (Extraction, Transformation and Loading) utilities to sort and analyze unstructured as well as structured data, data linking, profiling and hygiene. The Roxie which is an advanced data delivery engine offers both high concurrent as well as low latency real-time query abilities.
Go
IBM Solution
IBM Big data analytics solution portfolio includes InfoSphere BigInsights, InfoSphere Streams, IBM PureData, IBM Watson Explorer, DB2 with BLU Acceleration, InfoSphere Information Server, IBM Smart Analytics System and the InfoSphere Master Data Management.
Go
Informatica
It offers an effective, safe path to integrate data on Hadoop at all the scales without learning Hadoop.
Go
Intel
Intel portfolio comprises of technology products like 10 Gigabit server adapters, Intel Xeon processors, SSDs with Intel distribution to improve overall performance levels for big data solution projects.
Go
MapR
The MapR Distribution for Apache Hadoop offers companies with the highest grade distributed data platform to store and practice big data. MapR packages enablesinteractive, batch and real-time applications.
Go
MarkLogic
It brings all features in a unified system with a document-centric, structure-aware, schema-agnostic, transactional, clustered, secure, database server with search and application services.
Go
Microsoft
Microsoft Azure is a flexible and an open cloud tool that enables to swiftly build, deploy and handle applications transversely across a global network of Microsoft managed data centres. The applications can be created using any language, framework or platform and further integrated with public cloud apps in the required IT environment.
Go
MicroStrategy
Also called PRIME, being deployed on Cloud, offers visualization and dashboard engine with parallel in-memory data storage. The architecture enables to create and deploy powerful applications that deliver analytics to multiple users in a quick time and the cost.
Go
MongoDB
It is the finest NoSQL database, that empowers businesses with more agility and scalability. It is used to create new categories of apps, accelerate time to market, decrease costs and improves customer experience.
Go
Mu Sigma
It is a platform for Data Sciences which includes muHPC, music, and the muText. muXo is an engine for decision optimization which solves highly complex business problems. It offers continuously evolving and competitive meta-heuristic algorithms. On the other hand, muHPC is a popular suite of all the statistical algorithms, being integrated as R packages, used for Big Data analysis.
Go
Opera Solutions
Its vector Big data analytics and the signal-processing platform adds Big Data flows from all sources of enterprises; offering the technology to extract and to store signals and supports signal application deployment.
Go
Oracle
Oracle Big Data Analytics solutions comprise of Oracle Big Data Appliance, Oracle Exalytics In-Memory Machine and Oracle Exadata Database Machine. These are engineered systems that are pre-integrated to decrease the complexity and cost of the IT infrastructure. The database also includes Oracle Database, MySQL, Oracle NoSQL Database, MySQL Cluster, Oracle NoSQL Database, Oracle Event Processing, Oracle Coherence, Oracle Endeca Information Discovery and database analytics.
Go
Palantir
The solution comprises of Palantir Gotham to manage, integrate, analyze and secure enterprise data and Palantir Metropolis to enrich, integrate, model, and analyze any type of quantitative data.
Go
Pentaho
It provides an all-inclusive and unified solution that is used by big data lifecycle. No matter what is the data source, this solution offers visual big data analytics tools to extract data, get visualizations and analytics. It is highly scalable and uses the open standard-based architecture to integrate or extend present infrastructure.
Go
Pivotal
The solution assists to discover insights from data to create applications that can be used to store, deliver and manage value from large data sets utilizingdisruptive set of enterprise data products to serve customers. The products include MPP and column-store databases, Hadoop and in-memory data processing.
Go
Platfora
It is built on Spark, Hadoop and the native cloud APIs. It fits in anywhere including existing analytics ecosystems, BI tools and hardwares.
Go
QlikView
It offers 2 approaches to manage Big Data, both with finest user experience. QlikView offers both 100% In-Memory Architecture and a hybrid approach that works on both in-memory data and data from external sources.
Go
Redhat
Red Hat Enterprise Linux is a primary platform for big data deployments. It has features that meet advanced big data needs.
Go
SAP
SAP Big Data Analytics tool comprises of In-Memory Platform known as SAP HANA & SAP IQ, that is a column-oriented and grid-based parallel processing database. There is even SAP HANA tool and Apache Hadoop solution that are together. Big Data Analytics solutions comprise of Text Analytics solutions and Predictive Analytics.
Go
SAS
SAS Big Data Analytics portfolio comprises of credit scoring for SAS High-Performance Data Mining, SAS Enterprise Miner, SAS Scoring Accelerator, SAS Text Miner, SAS Model Manager and the SAS Visual Statistics.
Go
SGI
It provides HadoopSolutions with all the cluster installations with multiple nodes. SGI UV compromises of shared memory platform to search hidden data relationships with real-time analysis.
Go
Splunk
The analytics solution provides a complete portfolio of Big Data software like Splunk analytics for NoSQL Data Stores, Hadoop, Splunk Hadoop Connect, Splunk DB Connect and Hadoop Management.
Go
Syncsort
It assists Hadoop for collecting, processing and integrating complex data. It removes challenges for extensive Hadoop adoption by connecting, developing, deploying and accelerating it without any programming.
Go
Tableau
This solution connects to data, at any time and from anywhere, irrespective of its complexity and size or combination of structured and unstructured data with tools like Google BigQuery and Hadoop flavours.
Go
Talend Open Studio
Talend Open Studio is a multipurpose set of open source products for deploying, developing, testing, and administrating data management & application integration project tasks. Talend offers a unified platform that makes app integration and data management simpler. It further enables a unified environment for handling the complete lifecycle through enterprise boundaries.
Go
Teradata
The tool has built an architecture known as the Unified Data Architecture in Big Data Analytics. The Teradata Aster Discovery solution platform simplifies the analysis of critical business data insights from all the data categories. With its strong analytic applications joined with marginal time and work requirements, Teradata offers the insights required for different companies.
Go
Vmware
It is a highly robust platform with a high-performance virtualization layer that is used with server hardware resources, making them shareable by several virtual machines. Swiftly runs Hadoop workload for achieving better utilization, agility and reliability.
Go
 0   A   C   D   F   G   H   I   M   O   P   Q   R   S   T   V 

CC BY NC

Back to top