@@ -6,6 +6,26 @@ The format is based on [Keep a Changelog](http://keepachangelog.com/en/1.0.0/)
66and this project adheres to [ Semantic
77Versioning] ( http://semver.org/spec/v2.0.0.html ) .
88
9+ ## [ 0.4.1] - 2023-09-08
10+
11+ We recommend using the following SYCL versions with this release:
12+
13+ - DPC++: 61e51015 or newer
14+ - hipSYCL: d2bd9fc7 or newer
15+
16+ See our [ platform support guide] ( docs/platform-support.md ) for a complete list of all officially supported configurations.
17+
18+ ### Fixed
19+
20+ - Fix the behavior of dry runs (` CELERITY_DRY_RUN_NODES ` ) in the presence of fences or graph horizons (#196 , 069f5029)
21+ - Compatibility with recent hipSYCL >= d2bd9fc7 (#200 , b174df7d)
22+ - Compatibility with recent versions of Intel oneAPI and Arc-series dedicated GPUs (requires deactivating mimalloc, #203 , c1519624)
23+ - Work around a [ bug in DPC++] ( https://github.com/intel/llvm/issues/10982 ) that breaks selection of the non-default device (#210 , 2b652f8)
24+
25+ ### Removed
26+
27+ - Remove outdated workarounds for unsupported SYCL versions (#200 , 85b7479c)
28+
929## [ 0.4.0] - 2023-07-13
1030
1131We recommend using the following SYCL versions with this release:
@@ -15,7 +35,7 @@ We recommend using the following SYCL versions with this release:
1535
1636See our [ platform support guide] ( docs/platform-support.md ) for a complete list of all officially supported configurations.
1737
18- ## Added
38+ ### Added
1939
2040- Introduce new experimental ` host_object ` and ` side_effect ` APIs to express non-buffer dependencies between host tasks (#68 , 7a5326a)
2141- Add new ` CELERITY_GRAPH_PRINT_MAX_VERTS ` config options (#80 , d3dd722)
@@ -31,7 +51,7 @@ See our [platform support guide](docs/platform-support.md) for a complete list o
3151- Introduce ` CELERITY_ACCESSOR_BOUNDARY_CHECK ` CMake option to detect out-of-bounds buffer accesses inside device kernels (enabled by default for debug builds) (#178 , 2c738c8)
3252- Print more helpful error message when buffer allocations exceed available device memory (#179 , 79f97c2)
3353
34- ## Changed
54+ ### Changed
3555
3656- Update spdlog to 1.9.2 (#80 , a178828)
3757- Overhaul logging mechanism (#80 , 1b19bfc)
@@ -50,27 +70,27 @@ See our [platform support guide](docs/platform-support.md) for a complete list o
5070- Drastically improve performance of buffer data location tracking (#184 , adff79e)
5171- Switch to distributed scheduling model (#186 , 0970bff)
5272
53- ## Deprecated
73+ ### Deprecated
5474
5575- Passing ` sycl::device ` to ` distr_queue ` constructor (use a device selector instead) (#113 , 556b6f2)
5676- Capturing buffers and host objects by value into command group functions (capture by reference instead) (#173 , 0a743c7)
5777- ` allow_by_ref ` is no longer required to capture references into command group functions (#173 , 0a743c7)
5878
59- ## Removed
79+ ### Removed
6080
6181- Removed support for ComputeCpp (discontinued) (#167 , 68367dd)
6282- Removed deprecated ` host_memory_layout ` (use ` buffer_allocation_window ` instead) (#187 , f5e6510)
6383- Removed deprecated kernel dimension template parameter on ` one_to_one ` , ` fixed ` and ` all ` range mappers (#187 , 40a12a4)
6484- Kernels can no longer receive ` sycl::item ` (use ` celerity::item ` instead), this was already broken in 0.3.2 (#163 , 67ccacc)
6585
66- ## Fixed
86+ ### Fixed
6787
6888- Improve performance for buffer transfers on IBM Spectrum MPI (#114 , c60527f)
6989- Increase size limit on individual buffer transfer operations from 2 GiB to 128 GiB (#153 , 972682f)
7090- Fix race between creating collective groups and submitting host tasks (#152 , 0a4fca5)
7191- Align read-accessor ` operator[] ` with SYCL 2020 spec by returning const-reference instead of value (#156 , 5011ded)
7292
73- ## Internal
93+ ### Internal
7494
7595- Add microbenchmark suite (#100 , c2853ca, #107 , 51f5bc5)
7696- Update Catch2 to v3.3 (#102 , 9a6f19d, #129 , 0d1e36a, #162 , 5aa33d6)
0 commit comments