Hellmar Becker's Blog
About Archives Blogroll Publications GitHub
  • Druid 28 Sneak Peek: Ingesting Multiple Kafka Topics into One Datasource

    Oct 29, 2023 • blog, apache, druid, imply, streaming, kafka, tutorial

    Pizza

    Read on →

  • New in Imply Polaris: Data Retention Policy

    Sep 24, 2023 • blog, imply, polaris, druid, data_lifecycle

    Apache Druid has always had built-in data lifecycle management by way of retention rules. Specifying fixed time intervals or relative periods, you would tell Druid to retain only data segments that are not older than x days.

    Read on →

  • New in Apache Druid 27: Querying Deep Storage

    Sep 7, 2023 • blog, druid, imply, query, storage

    In realtime analytics, a common scenario is that you want to retain a lot of (years of) historical data in order to run analytics over a longer period of time. But these analytical queries occur infrequently and their performance is usually not critical. The bulk of everyday queries, however, accesses only a limited set of relatively fresh data, typically 1 or 2 weeks worth.

    Read on →

  • Using Druid with MinIO

    Aug 29, 2023 • druid, minio, tutorial, blog

    With on premise setups, compute/storage separation is often implemented using a NAS or similar storage unit that exposes an S3 API endpoint.

    Read on →

  • Druid Sneak Peek: Graphical Data Exploration

    Jul 30, 2023 • blog, apache, druid, imply, visualization, tutorial

    Screenshot of time chart

    Read on →

« Older Newer »

© - Powered by Jekyll & whiteglass - Subscribe via RSS