Skip to content

Diskover

Intelligent Data Management and Analytics

diskover diagram

Diskover is an open-source, powerful data management and analytics tool designed to help organizations efficiently manage and visualize their file systems. By providing insights into data usage and storage, Diskover enables users to make informed decisions about data retention, archiving, and resource allocation.

Key Features

  • Comprehensive File Indexing: Quickly scan and index file systems to gather detailed metadata about files and directories.
  • Data Visualization: Utilize a range of visual tools to analyze storage usage, identify large files, duplicates, and areas of inefficiency.
  • Customizable Reports: Generate detailed reports tailored to your specific needs, helping you understand data trends and optimize storage.
  • Scalable and Fast: Designed to handle large-scale file systems, Diskover can efficiently process and analyze vast amounts of data.
  • Integration with Elasticsearch: Leverage the power of Elasticsearch for fast and scalable search capabilities across your indexed data.

Getting Started

To get started with Diskover, visit the official GitHub repository for installation instructions, configuration guides, and additional resources. The repository provides comprehensive documentation to help you deploy and customize Diskover to suit your data management needs.

Community and Support

Join the Diskover community on GitHub to engage with other users, share feedback, and contribute to the project. Stay informed about the latest features, improvements, and community-driven enhancements.

Optimize your data management strategy with Diskover—an intelligent tool designed to provide clarity and control over your file systems.

Volumes

/nfs-nas-swarm/config/
/nfs-nas-swarm/data/

Deployment

No Special requirments

Docker swarm file

version: '3.3'
services:
  diskover:
    image: lscr.io/linuxserver/diskover:latest
    # command:
      # - python3 /app/diskover/diskover.py -f -i diskover-nfs-nas-swarm /nfs-nas-swarm
    #  - /bin/bash
    #  - -c
      # - crontab /config/crontabs/abc
    environment:
      ES_HOST: elasticsearch
      ES_PASS: changeme
      ES_PORT: '9200'
      ES_USER: elastic
      PGID: '1000'
      PUID: '1000'
    volumes:
     - /swarm/config/diskover/diskover:/config
     - /nfs-nas-swarm:/nfs-nas-swarm
     - /media_pool:/media_pool
     - /media_services_config:/media_services_config
    networks:
     - internal
     - traefik-public
    logging:
      driver: json-file
    deploy:
      labels:
        traefik.http.routers.diskover-https.entrypoints: https
        traefik.http.routers.diskover-https.tls.certresolver: le
        traefik.http.routers.diskover-http.middlewares: https-redirect
        traefik.http.services.diskover.loadbalancer.server.port: '80'
        traefik.http.routers.diskover-https.rule: Host(`diskover.vnerd.nl`)
        traefik.constraint-label: traefik-public
        traefik.http.routers.diskover-https.tls: 'true'
        traefik.http.routers.diskover-http.entrypoints: http
        traefik.http.routers.diskover-https.middlewares: authelia
        traefik.docker.network: traefik-public
        traefik.enable: 'true'
        traefik.http.routers.diskover-http.rule: Host(`diskover.vnerd.nl`)
      placement:
        constraints:
         - node.labels.Arch!=i686
         - node.labels.worker==enabled
  elasticsearch:
    image: docker.elastic.co/elasticsearch/elasticsearch:7.10.2
    environment:
      ES_JAVA_OPTS: -Xms1g -Xmx1g
      PGID: '1000'
      PUID: '1000'
      bootstrap.memory_lock: 'true'
      discovery.type: single-node
      xpack.security.enabled: 'false'
    ports:
     - 9200:9200
    volumes:
     - /swarm/data/diskover/esdata:/usr/share/elasticsearch/data
    networks:
     - internal
    logging:
      driver: json-file
    deploy:
      replicas: 0
      placement:
        constraints:
         - node.labels.Arch!=i686
         - node.labels.worker==enabled
  elasticsearch-helper:
    image: alpine:latest
    command:
     - sh
     - -c
     - sysctl -w vm.max_map_count=262144
    environment:
      PGID: '1000'
      PUID: '1000'
    networks:
     - internal
    logging:
      driver: json-file
    deploy:
      replicas: 0
      placement:
        constraints:
         - node.labels.Arch!=i686
         - node.labels.worker==enabled
networks:
  internal:
    driver: overlay
  traefik-public:
    external: true

Notes

Comments