Trino is a trusted, open source fast distributed SQL query engine for big data analytics. Your business and data teams can access structured or unstructured data in any storage system, in any environment, in real time.
Apache Arrow is an open source, cross-language development platform that combines the benefits of columnar data structures with in-memory computing. It makes the accessing and sharing of data in big data systems fast and efficient.
OpenRefine (formerly GoogleRefine)
OpenRefine is a powerful, open source tool for working with messy data: cleaning it; transforming it from one format into another; and linking and extending it with web services and external data.
Cassandra is a proven, open source database used when you need scalability and high availability without compromising performance. Cassandra's support for replicating across multiple data centers is best-in-class, providing lower latency for your users and the peace of mind of knowing that there are no single points of failure or network bottlenecks.
Graph databases are a type of NoSQL database created to address the limitations of relational databases. They are purpose built to store and navigate relationships through a highly connected data network of nodes. They offer the performance, flexibility and agility needed when querying across big data systems.
On-premise, Cloud-based, or Hybrid
We can work within any type of computing, storage, or services architecture.