11# Major Enhancements
22
3- ## qconf support to add/modify/delete/show complex entries individually
3+ ## v9.0.1beta
4+
5+ ### Utilization of additional data stores and activation of new thread pools
6+
7+ Beginning with patch v9.0.1 the new internal architecture of ` sge_qmaster ` is activated so that the component can
8+ utilize additional data stores by starting new thread pools.
9+
10+ * Listener thread pool: The listener thread pool was already activated in v9.0.0. It is used to handle incoming
11+ requests from clients and to distribute them to the corresponding processing components. Additionally, this pool
12+ of threads utilizes a new data store to answer authentication requests. Beginning with v9.0.1 this data store
13+ is used for even more requests to relieve other internal components within ` sge_qmaster ` .
14+
15+ * Reader thread pool: The reader thread pool is activated and can now utilize a corresponding thread pool.
16+ This will boost the performance of clusters in large environments where also users tend to request the status of the
17+ system very often, by using client commands like ` qstat ` , ` qhost ` or other commands that send read-only requests
18+ to ` sge_qmaster ` . The additional data store needs to be enabled manually by setting following qmaster parameter in the
19+ * qmaster_params* of the cluster configuration:
20+
21+ ```
22+ > qconf -mconf
23+ ...
24+ qmaster_params ...,DISABLE_SECONDARY_DS_READER=false
25+ ...
26+ ```
27+
28+ Please note that requests answered by the reader thread pool might deliver slightly outdated data compared to the
29+ requests answered with data from the main data store because both data stores can be slightly out of sync. The
30+ maximum deviation can be configured by setting the ` MAX_DS_DEVIATION ` in milliseconds within in the ` qmaster_params ` .
31+
32+ ```
33+ > qconf -mconf
34+ ...
35+ qmaster_params ...,MAX_DS_DEVIATION=1000
36+ ...
37+ ```
38+
39+ The default value is 1000 milliseconds. The value should be chosen carefully to balance the performance gain with
40+ the accuracy of the data.
41+
42+ With one of the upcoming patches we will introduce an addition concept of automatic-sessions that will allow to
43+ synchronize the data stores more efficiently and so that client commands can be enforced to get the most recent data.
44+
45+ * Enhanced monitoring: The monitoring of ` sge_qmaster ` has been enhanced to provide more detailed information about
46+ the utilization of the different thread pools. As also in the past the monitoring is enabled by setting the monitor
47+ time:
48+
49+ ```
50+ > qconf -mconf
51+ ...
52+ qmaster_params ...,MONITOR_TIME=10
53+ ...
54+ ```
55+
56+ ` qping ` will then show statistics about the handled requests per thread.
57+
58+ ```
59+ qping -i 1 -f <master_host> $SGE_QMASTER_PORT qmaster 1
60+ ...
61+ 10/11/2024 12:54:53 | reader: runs: 261.04r/s (GDI (a:0.00,g:2871.45,m:0.00,d:0.00,c:0.00,t:0.00,p:0.00)/s
62+ OTHER (ql:0))
63+ out: 261.04m/s APT: 0.0007s/m idle: 80.88% wait: 0.01% time: 9.99s
64+ 10/11/2024 12:54:53 | reader: runs: 279.50r/s (GDI (a:0.00,g:3074.50,m:0.00,d:0.00,c:0.00,t:0.00,p:0.00)/s
65+ OTHER (ql:0))
66+ out: 279.50m/s APT: 0.0007s/m idle: 79.08% wait: 0.01% time: 10.00s
67+ 10/11/2024 12:54:53 | listener: runs: 268.65r/s (in (g:268.34 a:0.00 e:0.00 r:0.30)/s
68+ GDI (g:0.00,t:0.00,p:0.00)/s)
69+ out: 0.00m/s APT: 0.0001s/m idle: 98.42% wait: 0.00% time: 9.99s
70+ 10/11/2024 12:54:53 | listener: runs: 255.37r/s (in (g:255.37 a:0.00 e:0.00 r:0.00)/s GDI (g:0.00,t:0.00,p:0.00)/s)
71+ out: 0.00m/s APT: 0.0001s/m idle: 98.54% wait: 0.00% time: 10.00s
72+ ```
73+
74+ ## v9.0.0
75+
76+ ### qconf support to add/modify/delete/show complex entries individually
477
578` Qconf ` also allows you to add, modify, delete and display complexes individually using the new ` -ace ` , ` -Ace ` ,
679` -mce ` , ` -Mce ` , ` -sce ` and ` -scel ` switches. Previously this was only possible as a group command for the whole
780complex set with ` -mq ` . More information can be found in the qconf(1) man page or by running ` qconf -help ` .
881
982(Available in Open Cluster Scheduler and Gridware Cluster Scheduler)
1083
11- ## Added support to supplementary group IDs in user, operator and manager lists.
84+ ### Added support to supplementary group IDs in user, operator and manager lists.
1285
1386Additionally, to user and primary group names, it is now possible to specify supplementary group IDs in user, operator,
1487and manager lists. User lists can be specified in host, queue, configuration, and parallel environment objects to allow
@@ -30,7 +103,7 @@ in a production environment. Enabling caching services like `nscd` can help redu
30103
31104(Available in Gridware Cluster Scheduler only)
32105
33- ## New internal architecture to support multiple Data Stores
106+ ### New internal architecture to support multiple Data Stores
34107
35108The internal data architecture of ` sge_qmaster ` has been changed to support multiple data stores. This change
36109does not have a major impact currently and is not visible to the user. However, it is a prerequisite for future
@@ -40,7 +113,7 @@ enhancing the performance of the cluster in large environments.
40113
41114(Available in Open Cluster Scheduler and Gridware Cluster Scheduler)
42115
43- ## New RSMAP (Resource Map) complex type
116+ ### New RSMAP (Resource Map) complex type
44117
45118Resource Maps are a new complex type that allows administrators to define a list of special resources which
46119are available on a host, e.g. GPU devices, networking devices, lists of network ports, or other special resources.
@@ -111,7 +184,7 @@ qrsh -l port_numbers=8 env | grep SGE_HGR
111184SGE_HGR_port_number=65000 65001 65002 65003 65004 65005 65006 65007
112185```
113186
114- ## Per HOST complex variables
187+ ### Per HOST complex variables
115188
116189The definition of complex variables contains the attribute ` consumable ` which could so far have the following values:
117190
@@ -125,7 +198,7 @@ of parallel jobs multiple tasks are running on the same host, the requested amou
125198once. E.g. multiple tasks of a parallel job can share the same GPU.
126199
127200
128- ## One-line JSON format for accounting and reporting files
201+ ### One-line JSON format for accounting and reporting files
129202
130203The accounting and reporting files contain one line per record.
131204The format of the records used to be a column-based format with a fixed number of columns,
@@ -202,7 +275,7 @@ as extensions to the accounting and reporting records (e.g. more exact timestamp
202275additional usage values like maxrss) are only done in the new format.
203276
204277
205- ## Resource and queue requests per scope (global, master, slave) for parallel jobs
278+ ### Resource and queue requests per scope (global, master, slave) for parallel jobs
206279
207280In former product versions resource requests for parallel jobs were only possible on the global level.
208281Resource requests for parallel jobs were applied to all tasks of the job, both master task (the job script)
0 commit comments