Log Aggregation — Centralising Logs

Q: Centralise multi-region?

Ingestion in nearest region + async replication. Or separate stores + federated search (Loki federation, CloudWatch cross-account).

Igor Verentsov

What is Log Aggregation

By Igor Verentsov · Updated May 23, 2026

Key idea:

Log aggregation — practice of collecting logs from multiple services into a central searchable store. Reason: grepping across 50 servers doesn't scale. Stack options: ELK (Elasticsearch + Logstash + Kibana) — powerful but expensive, Loki (Grafana, cheaper), Splunk (enterprise $$$), CloudWatch/DataDog Logs (SaaS). Critical features: search, alerts, retention, correlation with traces.

Below: details, example, related terms, FAQ.

Free online tool — HTTP header checker: instant results, no signup.

Check your site →

Details

Collector (node level): Filebeat, Fluent Bit, Vector, Promtail
Pipeline: parsing (JSON, multiline), enrichment (host, trace_id), routing
Storage: Elasticsearch (indexed, $$), Loki (chunks, $), S3 + Athena (archive, cheapest)
Retention: hot (7d, fast) + warm (30d, slower) + cold (1y+, S3)
Cost volatility: DEBUG logs in prod → 10× spend. Log level discipline critical

Example

# Fluent Bit config
[INPUT]
    Name tail
    Path /var/log/nginx/access.log

[OUTPUT]
    Name loki
    Host grafana-loki:3100
    Labels host=$HOSTNAME,service=nginx

Related Terms

Understanding Log Aggregation Architecture

Log aggregation architecture involves several components that work together to collect, transform, and store logs from various sources. At its core, the architecture typically consists of:

Log Sources: These are the systems generating logs, including servers, applications, and network devices. Each source can produce different log formats, such as JSON, text, or XML.
Log Shippers: Tools like Filebeat or Fluentd are used to forward logs from the source to the log aggregation system. They can parse, filter, and enrich logs before sending.
Log Processing: This involves transforming logs into a common format. Tools like Logstash can be configured to perform parsing, filtering, and enrichment using pipelines.
Central Storage: The processed logs are stored in a centralized database or storage solution, such as Elasticsearch or Amazon S3, allowing for efficient querying and retrieval.
Visualization and Analysis: Front-end tools like Kibana or Grafana provide dashboards and visualizations, making it easier to analyze log data and identify trends or anomalies.

This architecture allows organizations to manage logs at scale, enabling faster troubleshooting, compliance reporting, and security monitoring.

Practical Examples of Log Aggregation Configurations

Implementing log aggregation can vary based on the chosen stack. Here are practical examples using two popular log aggregation solutions: ELK Stack and Grafana Loki.

ELK Stack Example

To configure Logstash to aggregate logs from a web server, you can use the following logstash.conf configuration:

input {
  file {
    path => "/var/log/nginx/access.log"
    start_position => "beginning"
  }
}

filter {
  grok {
    match => {"message" => "%{IPORHOST:client} %{WORD:method} %{URIPATHPARAM:request} %{NUMBER:response} %{NUMBER:bytes}"}
  }
}

output {
  elasticsearch {
    hosts => ["http://localhost:9200"]
    index => "nginx-access-%{+YYYY.MM.dd}"
  }
}

Grafana Loki Example

For Grafana Loki, you can configure Promtail to scrape logs from a directory. Here’s a sample promtail.yaml configuration:

server:
  http:
    port: 9080

positions:
  filename: /tmp/positions.yaml

clients:
  - url: http://localhost:3100/loki/api/v1/push

scrape_configs:
  - job_name: nginx
    static_configs:
      - targets:
          - localhost
        labels:
          job: nginx
          __path__: /var/log/nginx/*.log

These configurations illustrate how to set up log aggregation for different environments, allowing for efficient log collection and analysis.

Benefits of Centralizing Logs in Modern IT Environments

Centralizing logs offers numerous advantages in modern IT environments, particularly as systems become more complex and distributed. Here are some key benefits:

Improved Troubleshooting: With a centralized log repository, developers and operations teams can quickly search and correlate logs from different services, drastically reducing the time spent diagnosing issues.
Enhanced Security Monitoring: Centralized logs allow for better detection of anomalies and security incidents. By correlating logs from various sources, security teams can identify suspicious patterns and respond more effectively.
Compliance and Auditing: Many regulatory frameworks require organizations to maintain detailed logs for auditing purposes. Centralized log storage simplifies compliance by ensuring logs are retained in a secure and accessible manner.
Scalability: As your infrastructure grows, managing logs individually becomes unfeasible. Centralizing logs allows for scalable storage solutions that can handle increased log volumes without significant overhead.
Cost Efficiency: While initial setup costs may be higher, centralizing logs can lead to long-term savings by reducing the time spent on manual log management and improving operational efficiency.

By centralizing logs, organizations can leverage the full power of their log data, enabling proactive monitoring and faster response times in critical situations.

Learn more

How-to

Glossary

What is CDC (Change Data Capture)

Research

Frequently Asked Questions

ELK vs Loki?

ELK: full-text indexed, fast search, expensive at scale. Loki: Prometheus-like labels + grep at query time, 10× cheaper. For high volume — Loki. For complex search — ELK.

Cost control?

Sampling (drop 90% INFO logs), log level discipline (INFO/WARN/ERROR not DEBUG in prod), TTL (< 30 days hot).

Centralise multi-region?