[Kubernetes] Loki Distributed

Posted Jun 19, 2024 Updated Jan 13, 2025

By KyungRyeol

4 min read

Helm 설치 및 설명 참고

Install the Loki Distributed Helm charts

  
helm repo add grafana https://grafana.github.io/helm-charts
helm repo update
helm install loki-distributed grafana/loki-distributed --namespace [NAMESPACE NAME] --version [VERSION]

Loki Distributed - Helm 설치 참고

Customize Default Configuration

values.yaml 수정
최상위 values.yaml을 수정하면 하위 폴더 values.yaml을 override 한다.
- Chart
  - https://github.com/grafana/helm-charts/tree/main/charts/loki-distributed
- Release file (.tgz)
  - https://github.com/grafana/helm-charts/releases

Loki Configurations

Loki 설정 값 문서

Grafana Loki 모범 사례
- 레이블은 Loki에서 Log를 검색할 때 필터링이 되는 기준이 되는데 여기선 정적 레이블을 가급적이면 사용하라고 가이드하고 있다.
  - 레이블은 클라이언트에서 설정한 다음에 Loki에게 Push하면 된다.
- chunk_target_size를 사용하라고 한다.
  - 기본 1.5MB 사이즈로 모든 청크 크기를 채우도록 하는게 좋다고 한다.
- chunk_encoding은 기본값이 gzip인데 snappy를 권고한다고 한다.
  - 이게 훨씬 더 압축을 푸는데도 빠르고 쿼리 속도도 더 빠르다고 한다.
- max_chunk_age는 2h을 권고한다고 한다.
- chunk_idle_period은 1h ~ 2h을 권고한다고 한다.
- RF(Replication factor)를 항상 설정할 것을 권고한다.
  - 데이티의 손실 가능성을 완화하기 위해 Ingester의 복제 요소를 일반적으로 3개로 설정할 것을 권고하고 있다.
  - 복제 요소가 데이터 손실을 방지하는 유일한 요소는 아니며, 주요 목적은 롤아웃 및 재시작 중에 쓰기가 중단되지 않도록 하는 것이다.

참고
https://grafana.com/docs/loki/latest/best-practices/
https://grafana.com/blog/2021/02/16/the-essential-config-settings-you-should-use-so-you-wont-drop-logs-in-loki/

Request Validation, Rate-Limit 에러
- 아래의 값으로 설정하면 필히 쓰로틀링이 걸리게 된다. 하여 적절한 값으로 조정이 필요하다.
- ingestion_rate_mb는 기본값 4이며 ingestion_burst_size_mb는 기본값 6이다.
  - ingestion_rate_mb: 20
  - ingestion_burst_size_mb: 40

참고
https://grafana.com/docs/loki/latest/operations/request-validation-rate-limits/
https://grafana.com/docs/loki/latest/configuration/#limits_config

Loki Grafana 모니터링
- Loki는 기본적으로 /metrics 엔드포인트로부터 각 Components들의 Metric들을 확인할 수 있다.
- 모든 컴포넌트들의 Service annotation에 아래의 문구를 추가해준다.
  - 이렇게 하면 Prometheus가 자동으로 /metrics 엔드포인트로 메트릭들을 scrape 해간다.
    1 2 3 prometheus.io/scrape: "true" prometheus.io/path: "/metrics" prometheus.io/port: "3100"

Loki에 대한 모니터링 참고

analytics false

  
analytics:
  reporting_enabled: false

Compactor와 Table Manager

Grafana Loki의 로그 보존(Retention)은 Compactor 혹은 Table Manager에 의해 수행된다.
현재 Table Manager를 통한 Retention은 TTL을 통해 달성되며 boltdb-shipper, chunk/index store 모두 작동한다.
Compactor를 통한 Retention은 boltdb-shipper 저장소에서만 지원된다.
만약 Compactor로 Retention을 적용한다면 Table Manager는 필요로 하지 않게 될 수 있다.

Compactor 설정 예시

  
compactor:
  retention_delete_delay: 2h
  delete_request_store: s3
  working_directory: "/var/loki/compactor"
  retention_enabled: true
  retention_delete_worker_count: 150
  apply_retention_interval: 1h
  compaction_interval: 5m

Compactor의 Retention은 limits_config에 설정해주면 된다.
Retention 설정 참고

storage

  
schema_config:
  configs:
    - from: "2024-06-19"
      index:
        period: 24h
        prefix: tsdb_index_
      object_store: s3
      schema: v13
      store: tsdb

storage_config:
  aws:
    s3: http://access_key:secret_key@endpoint:port
    bucketnames: loki-logs
    s3forcepathstyle: true
  tsdb_shipper:
    active_index_directory: /var/loki/tsdb-index
    cache_location: /var/loki/tsdb-cache
    index_gateway_client:
      # only applicable if using microservices where index-gateways are independently deployed.
      # This example is using kubernetes-style naming.
      server_address: dns:///index-gateway.<namespace>.svc.cluster.local:9095

minio 설치 참고

Install Customize Default Configuration

  
helm install [RELEASE NAME] [Chart.yaml 경로] -f [YAML 파일 또는 URL에 값 지정 (여러 개를 지정가능)] -n [NAMESPACE NAME]

  
helm install loki-distributed grafana/loki-distributed -f override-values.yaml -n [NAMESPACE NAME]

Uninstall the Chart

  
helm uninstall [RELEASE NAME] -n [NAMESPACE NAME]

Kubernetes, Grafana

This post is licensed under CC BY 4.0 by the author.

Install the Loki Distributed Helm charts

Customize Default Configuration

Loki Configurations

analytics false

Compactor와 Table Manager

storage

Install Customize Default Configuration

Uninstall the Chart

Trending Tags