Skip to content

Commit 187cc09

Browse files
committed
EH: adapted release notes got v9.0.1beta
1 parent e3a5ba7 commit 187cc09

File tree

1 file changed

+80
-7
lines changed

1 file changed

+80
-7
lines changed

doc/markdown/manual/release-notes/03_major_enhancements.md

Lines changed: 80 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -1,14 +1,87 @@
11
# Major Enhancements
22

3-
## qconf support to add/modify/delete/show complex entries individually
3+
## v9.0.1beta
4+
5+
### Utilization of additional data stores and activation of new thread pools
6+
7+
Beginning with patch v9.0.1 the new internal architecture of `sge_qmaster` is activated so that the component can
8+
utilize additional data stores by starting new thread pools.
9+
10+
* Listener thread pool: The listener thread pool was already activated in v9.0.0. It is used to handle incoming
11+
requests from clients and to distribute them to the corresponding processing components. Additionally, this pool
12+
of threads utilizes a new data store to answer authentication requests. Beginning with v9.0.1 this data store
13+
is used for even more requests to relieve other internal components within `sge_qmaster`.
14+
15+
* Reader thread pool: The reader thread pool is activated and can now utilize a corresponding thread pool.
16+
This will boost the performance of clusters in large environments where also users tend to request the status of the
17+
system very often, by using client commands like `qstat`, `qhost` or other commands that send read-only requests
18+
to `sge_qmaster`. The additional data store needs to be enabled manually by setting following qmaster parameter in the
19+
*qmaster_params* of the cluster configuration:
20+
21+
```
22+
> qconf -mconf
23+
...
24+
qmaster_params ...,DISABLE_SECONDARY_DS_READER=false
25+
...
26+
```
27+
28+
Please note that requests answered by the reader thread pool might deliver slightly outdated data compared to the
29+
requests answered with data from the main data store because both data stores can be slightly out of sync. The
30+
maximum deviation can be configured by setting the `MAX_DS_DEVIATION` in milliseconds within in the `qmaster_params`.
31+
32+
```
33+
> qconf -mconf
34+
...
35+
qmaster_params ...,MAX_DS_DEVIATION=1000
36+
...
37+
```
38+
39+
The default value is 1000 milliseconds. The value should be chosen carefully to balance the performance gain with
40+
the accuracy of the data.
41+
42+
With one of the upcoming patches we will introduce an addition concept of automatic-sessions that will allow to
43+
synchronize the data stores more efficiently and so that client commands can be enforced to get the most recent data.
44+
45+
* Enhanced monitoring: The monitoring of `sge_qmaster` has been enhanced to provide more detailed information about
46+
the utilization of the different thread pools. As also in the past the monitoring is enabled by setting the monitor
47+
time:
48+
49+
```
50+
> qconf -mconf
51+
...
52+
qmaster_params ...,MONITOR_TIME=10
53+
...
54+
```
55+
56+
`qping` will then show statistics about the handled requests per thread.
57+
58+
```
59+
qping -i 1 -f <master_host> $SGE_QMASTER_PORT qmaster 1
60+
...
61+
10/11/2024 12:54:53 | reader: runs: 261.04r/s (GDI (a:0.00,g:2871.45,m:0.00,d:0.00,c:0.00,t:0.00,p:0.00)/s
62+
OTHER (ql:0))
63+
out: 261.04m/s APT: 0.0007s/m idle: 80.88% wait: 0.01% time: 9.99s
64+
10/11/2024 12:54:53 | reader: runs: 279.50r/s (GDI (a:0.00,g:3074.50,m:0.00,d:0.00,c:0.00,t:0.00,p:0.00)/s
65+
OTHER (ql:0))
66+
out: 279.50m/s APT: 0.0007s/m idle: 79.08% wait: 0.01% time: 10.00s
67+
10/11/2024 12:54:53 | listener: runs: 268.65r/s (in (g:268.34 a:0.00 e:0.00 r:0.30)/s
68+
GDI (g:0.00,t:0.00,p:0.00)/s)
69+
out: 0.00m/s APT: 0.0001s/m idle: 98.42% wait: 0.00% time: 9.99s
70+
10/11/2024 12:54:53 | listener: runs: 255.37r/s (in (g:255.37 a:0.00 e:0.00 r:0.00)/s GDI (g:0.00,t:0.00,p:0.00)/s)
71+
out: 0.00m/s APT: 0.0001s/m idle: 98.54% wait: 0.00% time: 10.00s
72+
```
73+
74+
## v9.0.0
75+
76+
### qconf support to add/modify/delete/show complex entries individually
477

578
`Qconf` also allows you to add, modify, delete and display complexes individually using the new `-ace`, `-Ace`,
679
`-mce`, `-Mce`, `-sce` and `-scel` switches. Previously this was only possible as a group command for the whole
780
complex set with `-mq`. More information can be found in the qconf(1) man page or by running `qconf -help`.
881

982
(Available in Open Cluster Scheduler and Gridware Cluster Scheduler)
1083

11-
## Added support to supplementary group IDs in user, operator and manager lists.
84+
### Added support to supplementary group IDs in user, operator and manager lists.
1285

1386
Additionally, to user and primary group names, it is now possible to specify supplementary group IDs in user, operator,
1487
and manager lists. User lists can be specified in host, queue, configuration, and parallel environment objects to allow
@@ -30,7 +103,7 @@ in a production environment. Enabling caching services like `nscd` can help redu
30103

31104
(Available in Gridware Cluster Scheduler only)
32105

33-
## New internal architecture to support multiple Data Stores
106+
### New internal architecture to support multiple Data Stores
34107

35108
The internal data architecture of `sge_qmaster` has been changed to support multiple data stores. This change
36109
does not have a major impact currently and is not visible to the user. However, it is a prerequisite for future
@@ -40,7 +113,7 @@ enhancing the performance of the cluster in large environments.
40113

41114
(Available in Open Cluster Scheduler and Gridware Cluster Scheduler)
42115

43-
## New RSMAP (Resource Map) complex type
116+
### New RSMAP (Resource Map) complex type
44117

45118
Resource Maps are a new complex type that allows administrators to define a list of special resources which
46119
are available on a host, e.g. GPU devices, networking devices, lists of network ports, or other special resources.
@@ -111,7 +184,7 @@ qrsh -l port_numbers=8 env | grep SGE_HGR
111184
SGE_HGR_port_number=65000 65001 65002 65003 65004 65005 65006 65007
112185
```
113186

114-
## Per HOST complex variables
187+
### Per HOST complex variables
115188

116189
The definition of complex variables contains the attribute `consumable` which could so far have the following values:
117190

@@ -125,7 +198,7 @@ of parallel jobs multiple tasks are running on the same host, the requested amou
125198
once. E.g. multiple tasks of a parallel job can share the same GPU.
126199

127200

128-
## One-line JSON format for accounting and reporting files
201+
### One-line JSON format for accounting and reporting files
129202

130203
The accounting and reporting files contain one line per record.
131204
The format of the records used to be a column-based format with a fixed number of columns,
@@ -202,7 +275,7 @@ as extensions to the accounting and reporting records (e.g. more exact timestamp
202275
additional usage values like maxrss) are only done in the new format.
203276

204277

205-
## Resource and queue requests per scope (global, master, slave) for parallel jobs
278+
### Resource and queue requests per scope (global, master, slave) for parallel jobs
206279

207280
In former product versions resource requests for parallel jobs were only possible on the global level.
208281
Resource requests for parallel jobs were applied to all tasks of the job, both master task (the job script)

0 commit comments

Comments
 (0)