Sizing drivers and their impact

Use the information in this topic to calculate the deployment size of the product in your environment.

Notes

The following table describes some of the primary drivers that impact product performance, along with their approximate impact on sizing:

DriversConsiderations and impact

Retention period

An increase in the retention period impacts the memory and storage requirements for one or more Indexers deployed in your environment.

Impact on memory

The product stores data in time-based indices (buckets). The capacity of each bucket can be calculated in terms of either volume of data or time duration (in hours).

For each index, the minimum amount of RAM required by the Indexer is approximately 18 MB.

By default, each day, data is indexed into 4 buckets of 6 hours each. For a 7-day retention period (the default), you need 4 (buckets) * 7 (retention in days) = 28 buckets.

The total RAM requirement = 28 (buckets) * 18 (MB per bucket) = 504 MB.

  • RAM required (in MB) = Number of buckets required * 18

  • Number of buckets required = 4 (configurable) * Retention period (in days)

Impact on storage

For every 1-day increase in your retention period, you require an increase of 1.3 times the volume of incoming data (per day).

With a 7-day retention period (the default), if you are indexing 10 GB of data per day, your storage requirement can be calculated as 7 (retention in days) * 10 (incoming data in GB) * 1.3 = 91 GB

Storage required = Retention period (in days) * Volume of data (per day) * 1.3

Number of concurrent users

Multiple threads are created for every search that is triggered. The time taken by each thread is minimal and is approximately an average of 0.5 CPUs per user (for each search execution).

Each notification or dashlet execution is the equivalent of one search execution.

Impact on CPU

Parallel executions of search requests, dashlets, and notifications by concurrent users can impact the CPU utilization of your system. This factor is applicable to one or more Indexers and one or more Search components.

CPU required = (Number of notification executions + Number of dashlet executions + Number of search executions) * 0.5

Impact on memory

For every search that you execute, a certain amount of data is loaded to the memory, especially for fields (on the Filters panel on the Search tab) and advanced search commands (for example, group and stats). The memory impact is higher if the fields involved are high cardinality fields with many unique values. The Java heap size requirement increases in proportion to the number of searches performed by concurrent users.

For example, eight concurrent searches running field calculations for a high cardinality field against the same index at the same time will most likely require the same Java heap size as that of one field calculation. On the other hand, eight concurrent searches running field calculations for a high cardinality field against different indices at the same time requires around eight times the Java heap size of one field calculation.

No simple formula exists for calculating the impact of the number of concurrent users on memory. You can allocate an additional 100 MB RAM per concurrent user, this additional RAM is required for the Indexer component.

Volume of data indexed (per day)

The volume of data indexed per day can impact your memory and storage requirements for one or more Indexers deployed in your environment.

Impact on memory

The product stores data in time-based indices (buckets). The capacity of each bucket can be calculated in terms of either volume of data or time duration (in hours). Every search performed has an impact on the memory of the Indexer. The bucket capacity plays an important role in helping you control the memory needs.

No index can contain more than 250 GB of data. If you want to index 1 TB GB of data per day, then 1000 / 250 = 4 buckets per day.

Recommendation

BMC recommends that you use different index blocks so that the overall data is distributed in different indexes per index block per day.

Therefore, the indexed duration (in hours) is 24 / 4 = 6 hours. This value must be used as the value for the intervalInHrs property. For more information, see Component configuration recommendations.

Number of buckets required (per day) =  Volume of data (per day) / 5

Indexed duration (in hours) = 24 / Number of buckets required (per day)

Impact on storage

The storage required for your system is 1.34 times the volume of data (indexed per day) depending on the defined data-retention period.

Storage required =  1.34 * Volume of data (per day) * Retention period (in days)

Anomaly detection

Anomaly detection can impact your memory and Indexer CPU utilization.

Considerations and impact

A large number of target hosts are impacted.

Impact on memory

If you have greater than 2,000 target hosts, then consider increasing your server console/search Java Heap settings. Use the following formula to calculate the additional amount of Java Heap that will be required to be added to the custom/config/services/server.conf file:

Additional amount of Java Heap to be added to the custom/config/services/server.conf file = (100 KB * (number of hosts – 2,000) ) / number of search servers

Impact on Indexer CPU utilization

The Indexer CPU utilization can more than double while the background anomaly baseline job is running. The total number of hosts directly impacts how long the anomaly job takes to run. Additional search servers can help reduce the time required for the anomaly baseline job to complete, but at the same time will require higher indexer CPU utilization to service each additional server.

Archiving

Archiving can impact the number of Indexer nodes and storage requirement.

Considerations and impact

Number of Archive nodes should be the same or more than the number of Live nodes.

Impact of storage

Archive disk storage requirement=1.3 times size of the original data you want to archive. So, If you want to archive one year's worth of data and are indexing 100 GB per day, then the disk archive storage space= 100 GB/day * 365 days * 1 3.

Archive disk storage requirement= (Amount of data to be archived per day) * 365 * 1.3

Was this page helpful? Yes No Submitting... Thank you

Comments