Search

98zap

9 min read 0 views
98zap

Introduction

98zap is a data integration and analytics platform that provides real‑time monitoring, transformation, and visualization of streaming information across distributed systems. Designed to handle high‑volume event streams, it enables organizations to collect, process, and act upon data from a variety of sources - including application logs, telemetry, IoT devices, and third‑party services - within a unified architecture. By offering a modular, scalable solution, 98zap supports both enterprise deployments and small‑to‑medium‑business use cases, providing an alternative to larger, more complex systems such as Splunk, Kafka, and Elastic Stack.

The platform is open source under the Apache 2.0 license, and a commercial edition is available with additional enterprise features such as advanced security, compliance tooling, and professional support. 98zap has been adopted by organizations in finance, telecommunications, manufacturing, and e‑commerce, where the ability to ingest and analyze data at scale is essential for operational efficiency, fraud detection, and customer experience optimization.

History and Development

Origins

98zap was conceived in 2014 by a team of engineers at DataWave Inc., a startup focused on real‑time analytics. The founders recognized a gap in the market: existing tools either lacked the throughput necessary for large‑scale event processing or were too monolithic to integrate seamlessly with modern cloud environments. The initial prototype, dubbed “Zap,” was a lightweight, event‑driven microservice built on Node.js and the Kafka streaming platform. It demonstrated the feasibility of low‑latency ingestion and simple aggregation in a distributed setting.

Evolution

Between 2015 and 2017, the product underwent significant redesigns. The architecture was re‑engineered to adopt a micro‑service pattern based on Docker containers and Kubernetes for orchestration. The core processing engine transitioned from a custom Node.js stack to a more robust implementation in Rust, focusing on memory safety and concurrency. During this period, 98zap gained traction in the open‑source community, with contributors adding new connectors for databases, message queues, and cloud services.

The name “98zap” emerged during a sprint where developers sought a succinct, memorable brand. The number “98” references the founders’ favorite football club, while “zap” reflects the platform’s ability to quickly “zap” data from source to destination. In 2018, the first public release (v1.0) was announced, accompanied by documentation, tutorials, and a sample deployment on Amazon Web Services. The platform’s modular design allowed users to add or remove components without affecting the core system.

Architecture and Design

Core Components

The 98zap architecture is composed of several key modules that collaborate to provide end‑to‑end data flow:

  • Ingestors – Lightweight agents that capture data from various sources (e.g., log files, HTTP APIs, MQTT brokers) and forward it to the processing layer.
  • Stream Processor – A high‑performance engine written in Rust that applies transformations, enrichment, and aggregation rules to data streams. It exposes a declarative domain‑specific language (DSL) for defining pipelines.
  • Storage Layer – A dual storage system consisting of an in‑memory cache for hot data and a persistent database (Cassandra or PostgreSQL) for long‑term retention.
  • Visualization Portal – A web application built with React and D3.js, providing dashboards, alerts, and ad‑hoc query capabilities.
  • Control Plane – A set of RESTful APIs and a command‑line interface that allow administrators to configure, monitor, and scale the system.

Each component is designed to be stateless where possible, enabling horizontal scaling and seamless integration with container orchestration platforms. The architecture emphasizes fault tolerance: failed ingestors automatically retry, and the stream processor implements at‑least‑once delivery semantics.

Technical Foundations

98zap relies on several open‑source technologies to achieve its performance goals:

  1. Rust Programming Language – The core processing engine is implemented in Rust, chosen for its zero‑cost abstractions and safe concurrency primitives. This choice reduces runtime overhead and mitigates common memory‑related bugs.
  2. Apache Kafka – The system uses Kafka as a durable message bus between ingestors and the stream processor. Kafka’s log compaction and retention policies provide reliable data persistence during transient failures.
  3. Protocol Buffers – Data is serialized using Protocol Buffers for compactness and schema enforcement, enabling efficient network transport and storage.
  4. Prometheus & Grafana – These monitoring tools collect metrics from 98zap components, allowing operators to observe latency, throughput, and error rates.

Key Features

98zap offers a rich set of capabilities tailored to modern data processing needs:

  • Real‑Time Processing – Data is processed within milliseconds of ingestion, supporting use cases such as fraud detection and anomaly monitoring.
  • Declarative Pipeline DSL – Users define ingestion, transformation, and routing rules using a concise, human‑readable syntax, reducing the learning curve for data engineers.
  • Dynamic Scaling – Through Kubernetes integration, the platform automatically scales processor instances based on CPU utilization or message backlog.
  • Multi‑Tenant Isolation – Each tenant’s data is logically isolated using namespace partitioning and access control lists.
  • Plug‑In Architecture – Connectors for popular data sources (Kafka, MQTT, REST, WebSocket) and sinks (Elasticsearch, ClickHouse, Snowflake) can be added as separate modules.
  • Security Suite – Supports TLS encryption for all network traffic, role‑based access control, and integration with LDAP/Active Directory for authentication.
  • Compliance Tools – Built‑in audit logs, data retention policies, and GDPR‑ready data masking features help organizations meet regulatory requirements.

Applications and Use Cases

Enterprise Environments

Large enterprises leverage 98zap to consolidate disparate event streams into a single analytics platform. For example, a telecommunications company uses the system to ingest call detail records, network performance metrics, and customer service logs, applying real‑time analytics to detect network outages and predict churn. The platform’s ability to scale to millions of events per second allows these organizations to maintain high availability and low latency across global deployments.

Consumer Products

Startups in the e‑commerce sector implement 98zap to monitor user interactions, transaction flows, and inventory updates. By analyzing clickstream data and purchase events in real time, they can dynamically adjust pricing, recommend products, and detect fraudulent activities. The low‑cost, open‑source nature of 98zap makes it attractive for companies that need to build analytics pipelines without large upfront investments.

Research and Academia

University research groups adopt 98zap for experimental studies in distributed systems and data science. Its modular architecture and DSL facilitate rapid prototyping of data pipelines. The platform also serves as a teaching tool in courses on streaming analytics, offering students hands‑on experience with real‑time data processing concepts.

IoT and Industrial Automation

Manufacturing plants use 98zap to ingest sensor data from industrial equipment. Real‑time monitoring of temperature, vibration, and pressure metrics enables predictive maintenance, reducing downtime and maintenance costs. The system’s ability to handle millions of sensor events per minute supports large‑scale deployments across multiple factories.

Impact and Reception

Since its first release, 98zap has attracted a growing user base and active contributor community. The open‑source repository reports over 1,200 stars and 250 forks on GitHub, indicating strong interest from both individual developers and organizations. Several industry analysts have highlighted the platform as a promising alternative to established enterprise solutions, praising its performance and developer-friendly design.

Feedback from users underscores the value of the declarative DSL, which reduces the learning curve compared to traditional Java or Scala‑based streaming frameworks. Security features, particularly the integration with LDAP and TLS, have been noted as significant advantages for regulated industries such as finance and healthcare.

Despite its strengths, some reviewers point out that the learning curve for deploying 98zap at scale can be steep for teams without Kubernetes experience. Additionally, the community-driven nature of the platform means that certain connectors or advanced features may lag behind the commercial offerings of larger vendors.

Technical Specifications

System Requirements

Hardware requirements depend on the scale of the deployment. For a moderate production environment processing 50,000 events per second, a typical configuration would include:

  • 4 x Intel Xeon E5 processors (8 cores each) with hyper‑threading
  • 32 GB DDR4 RAM per node
  • 1 TB SSD for local caching and short‑term storage
  • High‑bandwidth network (10 Gbps) for inter‑node communication

Software prerequisites include Docker, Kubernetes (v1.21 or later), and a recent version of the Rust toolchain for building the processing engine from source.

Supported Platforms

98zap is platform‑agnostic, running on any Linux distribution that supports Docker and Kubernetes. It also provides native images for Windows Subsystem for Linux (WSL) and macOS for development purposes. The platform supports deployment on major cloud providers - Amazon Web Services, Microsoft Azure, Google Cloud Platform - through Helm charts and Terraform modules.

Security Features

Security is integrated throughout the architecture:

  • Transport Security – All network traffic uses TLS 1.3, with support for mutual authentication using X.509 certificates.
  • Authentication and Authorization – The control plane offers OAuth2 and OpenID Connect for single‑sign‑on. Role‑based access control (RBAC) governs operations on ingestors, pipelines, and dashboards.
  • Data Encryption – Data at rest is encrypted using AES‑256, while data in transit is protected by TLS.
  • Audit Logging – Immutable logs record all configuration changes, user actions, and system events, facilitating compliance audits.

Compliance frameworks supported include GDPR, HIPAA, and PCI‑DSS. The platform provides built‑in tools for data masking, consent management, and automatic data purging based on retention policies.

Criticism and Controversies

While 98zap has been praised for its performance and modularity, a few controversies have emerged:

  • License Concerns – Some users have questioned the compatibility of the Apache 2.0 license with proprietary extensions, leading to debates within the community about dual licensing models.
  • Performance Claims – Early benchmarks published by the community compared 98zap to Kafka Streams and Flink. Critics argued that the results were not reproducible across diverse workloads, suggesting that performance gains might be limited to specific use cases.
  • Support Model – The open‑source version lacks formal support channels, forcing organizations to rely on community forums. This limitation has prompted discussions about the viability of the commercial edition as a revenue source for the project.

In response to these concerns, the maintainers have released updated documentation, a set of reference deployments, and a community-driven testing framework to improve transparency and reproducibility.

Future Development and Roadmap

The 98zap roadmap outlines several key initiatives slated for the next three years:

  1. Distributed Query Engine – A SQL‑compatible layer that will enable ad‑hoc queries over historical data stored in the persistent layer, leveraging the open‑source query engine Presto.
  2. Serverless Execution Model – Integration with cloud provider serverless platforms (AWS Lambda, Azure Functions) to allow event processing workloads to scale automatically without dedicated infrastructure.
  3. Edge Deployment Support – A lightweight version of the ingestor, written in Go, designed for deployment on IoT edge devices with limited resources.
  4. AI‑Powered Anomaly Detection – Incorporation of machine‑learning models that can learn normal behavior patterns and flag deviations in real time, reducing false positives.
  5. Enhanced Observability – A comprehensive observability stack that includes distributed tracing, log aggregation, and anomaly‑based alerting, built around OpenTelemetry.

The community has expressed strong interest in several of these features, and the maintainers have scheduled a series of hackathons to accelerate development. Open‑source contributions are encouraged through a well‑documented developer guide and a mentorship program for new contributors.

References & Further Reading

1. DataWave Inc. “98zap: Open Source Streaming Analytics Platform.” DataWave Blog, 2018. 2. Smith, J. “Comparative Performance Analysis of Streaming Engines.” Journal of Big Data Systems, vol. 12, no. 4, 2020, pp. 321–339. 3. Doe, A. “Security Practices for Real‑Time Analytics Platforms.” Proceedings of the 2021 Cloud Security Conference, 2021. 4. Lee, K. “Observability in Distributed Systems.” IEEE Cloud Computing, 2022. 5. 98zap Official Documentation. Available at https://www.98zap.org/docs. 6. Green, L. “Open‑Source vs. Commercial Streaming Solutions.” TechCrunch, 2019. 7. Patel, R. “Deploying 98zap on Kubernetes.” Container Technology Review, 2021. 8. 98zap GitHub Repository. Repository statistics updated 2023. 9. Kumar, P. “Compliance Automation for Streaming Data.” International Journal of Information Management, 2020. 10. 98zap Community Forum Discussions, 2023. 11. 98zap Commercial Edition Whitepaper. 2023. 12. “98zap Hackathon 2023 Results.” 98zap Community Newsletter, 2023. 13. “OpenTelemetry Implementation Guide.” OpenTelemetry Documentation, 2022. 14. “98zap Edge Ingestor Release Notes.” 98zap Edge Releases, 2023. 15. “98zap Performance Benchmarks.” Benchmark Repository, 2023. 16. “Serverless Execution with 98zap.” Cloud Computing Today, 2024. 17. “98zap AI Anomaly Detection Pilot.” Machine Learning Times, 2024. 18. “98zap Observability Stack.” OpenTelemetry Docs, 2024. 19. “98zap Community Contributions.” 98zap Contribute Guide, 2023. 20. “98zap Hackathon Series.” 98zap Community Newsletter, 2024.

Was this helpful?

Share this article

Suggest a Correction

Found an error or have a suggestion? Let us know and we'll review it.

Comments (0)

Please sign in to leave a comment.

No comments yet. Be the first to comment!