Hardware requirements [Документация VAS Experts]

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
en:dpi:qoe_analytics:implementation_administration:requirements [2025/12/08 11:56] elena.krasnobryzhen:dpi:qoe_analytics:implementation_administration:requirements [2025/12/09 08:49] (current) – [Table] elena.krasnobryzh
Line 25: Line 25:
  
 =====Detailed recommendations===== =====Detailed recommendations=====
-| CPU                         | **Single processor** supporting **SSE 4.2** instructions starting from [[http://en.wikipedia.org/wiki/Nehalem_(microarchitecture)|Intel Nehalem]] and [[https://en.wikipedia.org/wiki/Zen_2|AMD EPYC Zen2]] **with 4 or more cores** and a **base clock speed of 2.5 GHz or higher**. Choose CPUs with more cores. Clock speed is less important. For example, 16 cores at 2600 MHz is better than 8 cores at 3600 MHz.\\ \\ **Do not disable Hyper-threading and Turbo-Boost**. |   +| CPU                     | **Single processor** supporting **SSE 4.2** instructions starting from [[http://en.wikipedia.org/wiki/Nehalem_(microarchitecture)|Intel Nehalem]] and [[https://en.wikipedia.org/wiki/Zen_2|AMD EPYC Zen2]] **with 4 or more cores** and a **base clock speed of 2.5 GHz or higher**. Choose CPUs with more cores. Clock speed is less important. For example, 16 cores at 2600 MHz is better than 8 cores at 3600 MHz.\\ \\ **Do not disable Hyper-threading and Turbo-Boost**.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                
-| RAM                         | From 16 GB; memory modules must be installed **in all CPU channels** on the motherboard. The memory size should not be less than the volume of queried data. The more memory, the better performance when generating reports and the lower the disk load.\\ \\ **Always disable swap file**. |   +| RAM                     | From 16 GB; memory modules must be installed **in all CPU channels** on the motherboard. The memory size should not be less than the volume of queried data. The more memory, the better performance when generating reports and the lower the disk load.\\ \\ **Always disable swap file**.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    
-| Disks                       | To optimize storage cost, multiple types of disks are used:\\ 1. **default** — fast disks for data ingestion and aggregation processes, SSD NVMe in RAID0 recommended.\\ 2. **hot** — disks for storing data likely to be queried (usually up to 3 months), SSDs in RAID-10, RAID-5, RAID-6, or RAID-50.\\ 3. **cold** — high-capacity slow disks for long-term storage, HDDs in RAID-10, RAID-5, RAID-6, or RAID-50 recommended.\\ Retention period at each level is configured via GUI. Data migration and cleanup occur automatically according to settings. A mechanism for overflow protection is also provided. The main data volume is stored in /var/lib/clickhouse. Temporary data (IPFIX dumps) are stored in /var/qoestor/backend/dump. For best performance, these directories should be located on a separate disk or array. See [[en:dpi:qoe_analytics:implementation_administration:configuration_setup:disc]].\\ For OS and QoE Stor software installation, use two drives of at least 256GB combined in RAID1 (mirror). A hardware RAID controller is required. |   +| Disks                   **File system type: ext4.**\\ To optimize storage cost, multiple types of disks are used:\\ 1. **default** — fast disks for data ingestion and aggregation processes, SSD NVMe in RAID0 recommended.\\ 2. **hot** — disks for storing data likely to be queried (usually up to 3 months), SSDs in RAID-10, RAID-5, RAID-6, or RAID-50.\\ 3. **cold** — high-capacity slow disks for long-term storage, HDDs in RAID-10, RAID-5, RAID-6, or RAID-50 recommended.\\ Retention period at each level is configured via GUI. Data migration and cleanup occur automatically according to settings. A mechanism for overflow protection is also provided. The main data volume is stored in /var/lib/clickhouse. Temporary data (IPFIX dumps) are stored in /var/qoestor/backend/dump. For best performance, these directories should be located on a separate disk or array. See [[en:dpi:qoe_analytics:implementation_administration:configuration_setup:disc]].\\ For OS and QoE Stor software installation, use two drives of at least 256GB combined in RAID1 (mirror). A hardware RAID controller is required.  
-| QoE Cluster (Sharding)       | It is better to create several nodes and combine them into a cluster:\\ GUI can optimize queries so that all nodes build reports in parallel.\\ [[en:dpi:dpi_components:ipfix_balancer]] is used for even data distribution across nodes (round-robin), significantly improving performance.\\ If a node fails, the balancer automatically directs data to remaining nodes. General recommendation: more nodes and smaller data portions per node. This ensures:\\ 1. High performance\\ 2. Fault tolerance\\ 3. Scalability (by adding nodes to the cluster) |   |+| QoE Cluster (Sharding)  | It is better to create several nodes and combine them into a cluster:\\ GUI can optimize queries so that all nodes build reports in parallel.\\ [[en:dpi:dpi_components:ipfix_balancer]] is used for even data distribution across nodes (round-robin), significantly improving performance.\\ If a node fails, the balancer automatically directs data to remaining nodes. General recommendation: more nodes and smaller data portions per node. This ensures:\\ 1. High performance\\ 2. Fault tolerance\\ 3. Scalability (by adding nodes to the cluster)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   |
  
 =====Operation tips from Yandex ClickHouse===== =====Operation tips from Yandex ClickHouse=====
 You can read operation tips from Yandex ClickHouse at https://clickhouse.com/docs/operations/tips. You can read operation tips from Yandex ClickHouse at https://clickhouse.com/docs/operations/tips.