About 1,300,000 results
Open links in new tab
  1. GitHub - oxnr/awesome-bigdata: A curated list of awesome big …

    Embulk - open-source bulk data loader that helps data transfer between various databases, storages, file formats, and cloud services. Estuary - SaaS platform based on Gazette with plug …

  2. Awesome Open-Source Data Engineering - GitHub

    Awesome Open-Source Data Engineering This Awesome List aims at providing an overview of open-source projects related to data engineering. This is a community effort: please contribute …

  3. big-data · GitHub Topics · GitHub

    4 days ago · The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in …

  4. GitHub - trinodb/trino: Official repository of Trino, the distributed ...

    Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io) trino.io java distributed-systems data-science sql database big …

  5. IrBigDta/Awesome-Modern-Open-Source-Data-Engineering

    This curated list brings together powerful open-source tools, frameworks, and resources for data engineering 🛠️ and data science 📈. It started with inspiration from pracdata's awesome-open …

  6. big-data-projects · GitHub Topics · GitHub

    5 days ago · GitHub is where people build software. More than 150 million people use GitHub to discover, fork, and contribute to over 420 million projects.

  7. YTsaurus is a scalable and fault-tolerant open-source big data

    YTsaurus is a distributed storage and processing platform for big data with support for MapReduce model, a distributed file system and a NoSQL key-value database.

  8. High Performance Software-Defined Object Storage for Big Data …

    High Performance Software-Defined Object Storage for Big Data and AI, that supports Amazon S3 and Openstack Swift - open-io/oio-sds

  9. GitHub - sacridini/Awesome-Geospatial: Long list of geospatial …

    Interimage - Open Source GEOBIA software. Matlab - Multi-paradigm numerical computing environment and fourth-generation programming language. Metashape - Agisoft Metashape is …

  10. GitHub - pawl/awesome-etl: A curated list of awesome ETL …

    Talend - "an open source application for data integration job design with a graphical development environment" N8n - "Free and open fair-code licensed node based Workflow Automation Tool.