pythonhealthdatascience
diff --git a/‎.flake8‎
Lines changed: 2 additions & 2 deletions b/‎.flake8‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎.lintr‎
Lines changed: 12 additions & 8 deletions b/‎.lintr‎
Lines changed: 12 additions & 8 deletions
diff --git a/‎_quarto.yml‎
Lines changed: 5 additions & 0 deletions b/‎_quarto.yml‎
Lines changed: 5 additions & 0 deletions
diff --git a/‎pages/experiments/scenarios.qmd‎
Lines changed: 14 additions & 0 deletions b/‎pages/experiments/scenarios.qmd‎
Lines changed: 14 additions & 0 deletions
diff --git a/‎pages/experiments/tables_figures.qmd‎
Lines changed: 43 additions & 23 deletions b/‎pages/experiments/tables_figures.qmd‎
Lines changed: 43 additions & 23 deletions
diff --git a/‎pages/further_info/conclusion.qmd‎
Lines changed: 127 additions & 0 deletions b/‎pages/further_info/conclusion.qmd‎
Lines changed: 127 additions & 0 deletions
@@ -3,15 +3,15 @@ per-file-ignores =
     docstrings*.py: F811
     length_warmup*.py: F401,F821
     logs*.py: F811
-    mathematical*.py: F401
+    mathematical*.py: F401,F821,E402
     n_reps*.py: E0602,F401,F821,W0611
     outputs*.py: F811
     parallel*.py: F401,F821
     parameters_file*.py: E402,F811,E0102
     parameters_validation*.py: F821
     replications*.py: F401,F811,F821
     scenarios*.py: F401,F821
-    tables_figures*.py: F401,F821
+    tables_figures*.py: F401,F821,E402
     tests*.py: F821
     tests_resources/*.py: E0401
     */outputs_resources/*.py: E261,E262,F821
 
@@ -5,23 +5,28 @@ exclusions: list(
         unused_import_linter = Inf,
         object_usage_linter = Inf
       ),
+    "pages/experiments/tables_figures.qmd" = list(
+        object_name_linter = 65
+      ),
     "pages/inputs/parameters_validation.qmd" = list(
         object_usage_linter = 771:772
       ),
     "pages/output_analysis/length_warmup.qmd" = list(
-        unused_import_linter = Inf
+        unused_import_linter = Inf,
+        object_usage_linter = Inf
+      ),
+    "pages/output_analysis/length_warmup_resources/metrics.R" = list(
+        object_usage_linter = Inf
       ),
     "pages/output_analysis/n_reps.qmd" = list(
         unused_import_linter = Inf,
         object_usage_linter = Inf
       ),
     "pages/output_analysis/outputs.qmd" = list(
-        one_call_pipe_linter = Inf,
         line_length_linter = 2850
       ),
     "pages/output_analysis/parallel.qmd" = list(
-        object_usage_linter = Inf,
-        one_call_pipe_linter = Inf
+        object_usage_linter = Inf
       ),
     "pages/output_analysis/outputs_resources/model.R" = list(
         object_usage_linter = Inf
@@ -34,10 +39,9 @@ exclusions: list(
       ),
     "pages/style_docs/linting_resources/code.R",
     "pages/verification_validation/mathematical.qmd" = list(
-        unused_import_linter = Inf
-      ),
-    "pages/verification_validation/tests_resources/simulation.R" = list(
-        one_call_pipe_linter = 228
+        unused_import_linter = Inf,
+        object_usage_linter = Inf,
+        library_call_linter = Inf
       ),
     "pages/verification_validation/tests_resources/test_back.R" = list(
         expect_identical_linter = Inf
 
@@ -23,6 +23,7 @@ website:
       - pages/project/stars.qmd
     - section: "Introduction"
       contents:
+      - pages/intro/des.qmd
       - pages/intro/rap.qmd
       - pages/intro/guidelines.qmd
       - pages/intro/foss.qmd
@@ -77,6 +78,10 @@ website:
       - pages/sharing/citation.qmd
       - pages/sharing/changelog.qmd
       - pages/sharing/archive.qmd
+    - section: "Closing remarks"
+      contents:
+      - pages/further_info/conclusion.qmd
+      - pages/further_info/feedback.qmd
   favicon: images/stars_logo_blue.png
   navbar:
     collapse: false
 
@@ -21,6 +21,9 @@ bibliography: scenarios_resources/references.bib
 **Relevant reproducibility guidelines:**
 
 * STARS Reproducibility Recommendations (⭐): Provide code for all scenarios and sensitivity analyses.
+* STARS Reproducibility Recommendations: Save outputs to a file.
+* STARS Reproducibility Recommendations: Avoid excessive output files.
+* STARS Reproducibility Recommendations: Address large file sizes.
 
 **Pre-reading:**
 
@@ -441,6 +444,17 @@ kable(sensitivity_results) |> scroll_box(height = "400px")
 
 :::
 
+## Saving results
+
+Saving your simulation results to file is important for reproducility, as it allows others to verify your findings and generate consistent (or new) figures and analyses, even if they can't re-run your simulation.
+This practice is transparent, providing a clear record of what you found, and it is valuable for you as well, ensuring you always know exactly what results you obtained, and can regenerate your own tables and figures from the results.
+
+However, there are two key things to keep in mind:
+
+**1. Number of files.** Running many scenarios or replications can easily lead to an explosion of output files. Do not save each scenario or run as a separate file unless there is a specific need. Instead, combine all results into a **single file with columns marking scenario and replication IDs**. 
+
+**2. Avoid large file sizes.** Be strategic about what you save. For short tests and debugging, saving detailed ("patient-level") results makes sense. But for full-scale runs with many replications, those files can become unmanageably large. Generally, **save summary outputs** to file for analysis (e.g., means from each run), not massive raw datasets. If you absolutely need to save or share large files, use compressed formats (e.g., `csv.gz`). Also, keep in mind practical size limits for version control: for example, GitHub's individual file size limit is 100 MB.
+
 ## Explore the example models
 
 <div class="h3-tight"></div>
 
@@ -8,6 +8,7 @@ bibliography: scenarios_resources/references.bib
 ```{python}
 #| echo: false
 # pylint: disable=too-many-locals,undefined-variable,unused-import
+# pylint: disable=pointless-statement
 ```
 
 :::: {.pale-blue}
@@ -165,7 +166,7 @@ def summarise_scenarios(results, groups, result_vars, path_prefix=None):
     result_vars : list
         List of performance measures to get results on (provided as strings).
     path_prefix : str, optional
-        Path prefix to save tables to. Each metric will be saved as 
+        Path prefix to save tables to. Each metric will be saved as
         {path_prefix}_{metric}.csv
 
     Returns
@@ -182,8 +183,10 @@ def summarise_scenarios(results, groups, result_vars, path_prefix=None):
             .apply(pd.Series)
             .reset_index()
         )
-        summary_table.columns = (list(summary_table.columns[:-4]) +
-                                ["mean", "std_dev", "ci_lower", "ci_upper"])
+        summary_table.columns = (
+            list(summary_table.columns[:-4]) +
+            ["mean", "std_dev", "ci_lower", "ci_upper"]
+        )
 
         # Add column to identify which metric this is
         summary_table["metric"] = result_var
@@ -312,7 +315,9 @@ HTML(to_html_datatable(sensitivity_tables["mean_patients_in_system"]))
 #'
 #' @return A named list of summary data frames.
 
-summarise_scenarios <- function(results, groups, result_vars, path_prefix = NULL) {
+summarise_scenarios <- function(
+  results, groups, result_vars, path_prefix = NULL
+) {
   summary_tables <- list()
 
   for (result_var in result_vars) {
@@ -321,8 +326,8 @@ summarise_scenarios <- function(results, groups, result_vars, path_prefix = NULL
       summarise(
         mean = mean(.data[[result_var]], na.rm = TRUE),
         std_dev = sd(.data[[result_var]], na.rm = TRUE),
-        ci_lower = t.test(.data[[result_var]])$conf.int[1],
-        ci_upper = t.test(.data[[result_var]])$conf.int[2],
+        ci_lower = t.test(.data[[result_var]])$conf.int[1L],
+        ci_upper = t.test(.data[[result_var]])$conf.int[2L],
         .groups = "drop"
       ) |>
       mutate(metric = result_var)
@@ -344,7 +349,7 @@ summarise_scenarios <- function(results, groups, result_vars, path_prefix = NULL
 ### Scenario analysis
 
 ```{r}
-result_variables = c(
+result_variables <- c(
   "mean_wait_time_doctor",
   "utilisation_doctor",
   "mean_queue_length_doctor",
@@ -699,49 +704,64 @@ This function is used to plot the results from the scenarios and sensitivity ana
 ```{r}
 #' Plot multiple performance measures at once
 #'
-#' @param summary_tables Named list of summary tables, one per metric (like output from summarise_scenarios()).
+#' @param summary_tables Named list of summary tables, one per metric
+#' (like output from summarise_scenarios()).
 #' @param x_var Name of variable to plot on x axis.
 #' @param colour_var Name of variable to colour lines with (can be NULL).
 #' @param name_mappings Optional named list for prettier axis/legend labels.
 #' @param path_prefix Optional path prefix to save figures.
 #' @return Named list of ggplot objects
 
 plot_metrics <- function(
-  summary_tables, x_var, colour_var = NULL,
-  name_mappings = NULL, path_prefix = NULL
+  summary_tables, x_var, colour_var = NULL, name_mappings = NULL,
+  path_prefix = NULL
 ) {
+  # List to store plots for each metric
   plot_list <- list()
 
   for (metric_name in names(summary_tables)) {
+    # Extract relevant results table
     summary_table <- summary_tables[[metric_name]]
 
-    y_var <- "mean"
-    ci_lower <- "ci_lower"
-    ci_upper <- "ci_upper"
-
-    xaxis_title <- if (!is.null(name_mappings[[x_var]])) name_mappings[[x_var]] else x_var
-    yaxis_title <- if (!is.null(name_mappings[[metric_name]])) name_mappings[[metric_name]] else metric_name
-    legend_title <- if (!is.null(colour_var) && !is.null(name_mappings[[colour_var]])) name_mappings[[colour_var]] else colour_var
+    # Helper to map a variable to display name if available in `name_mappings`.
+    # Just uses variable name if no mapping is found.
+    get_name <- function(var) {
+      if (!is.null(var) && !is.null(name_mappings[[var]])) {
+        name_mappings[[var]]
+      } else {
+        var
+      }
+    }
+    xaxis_title <- get_name(x_var)
+    yaxis_title <- get_name(metric_name)
+    legend_title <- get_name(colour_var)
 
+    # Create plot, with or without grouping colour variable
     if (!is.null(colour_var)) {
       summary_table[[colour_var]] <- as.factor(summary_table[[colour_var]])
-      p <- ggplot(summary_table, aes_string(x = x_var, y = y_var, group = colour_var, color = colour_var, fill = colour_var)) +
+      p <- ggplot(summary_table,
+                  aes_string(x = x_var, y = "mean", group = colour_var,
+                  color = colour_var, fill = colour_var)) +
         geom_line() +
-        geom_ribbon(aes_string(ymin = ci_lower, ymax = ci_upper), alpha = 0.1) +
-        labs(x = xaxis_title, y = yaxis_title, color = legend_title, fill = legend_title) +
+        geom_ribbon(aes_string(ymin = "ci_lower", ymax = "ci_upper"),
+                    alpha = 0.1) +
+        labs(x = xaxis_title, y = yaxis_title, color = legend_title,
+             fill = legend_title) +
         theme_minimal()
     } else {
-      p <- ggplot(summary_table, aes_string(x = x_var, y = y_var)) +
+      p <- ggplot(summary_table, aes_string(x = x_var, y = "mean")) +
         geom_line() +
-        geom_ribbon(aes_string(ymin = ci_lower, ymax = ci_upper), alpha = 0.1, show.legend = FALSE) +
+        geom_ribbon(aes_string(ymin = "ci_lower", ymax = "ci_upper"),
+                    alpha = 0.1, show.legend = FALSE) +
         labs(x = xaxis_title, y = yaxis_title) +
         theme_minimal()
     }
 
     # Save plot if prefix supplied
     if (!is.null(path_prefix)) {
       output_path <- paste0(path_prefix, "_", metric_name, ".png")
-      ggsave(filename = output_path, plot = p, width = 6.5, height = 4, bg = "white")
+      ggsave(filename = output_path, plot = p, width = 6.5, height = 4L,
+             bg = "white")
     }
 
     plot_list[[metric_name]] <- p
 
@@ -0,0 +1,127 @@
+---
+title: Conclusion
+---
+
+<!-- Hide as no python-content or r-content blocks -->
+<style>
+      #quarto-announcement {
+        display: none !important;
+      }
+</style>
+
+<br>
+
+::: {.pale-blue}
+
+**Well done, you made it to the end!** 😁
+
+This book has shared with you the knowledge and tools to create simulation models in Python or R as part of a reproducible analytical pipeline - models that others can reproduce, trust, understand, and build upon.
+
+:::
+
+## What's next?
+
+<div class="h3-tight"></div>
+
+### Explore the example models
+
+These example models demonstrate many of the practices covered in this book. They may seem daunting at first - there's a lot going on - but having worked through the book, you're in a good position to understand them.
+
+Remember, these are **examples**, not prescriptions. They're not perfect, and there's no single "right way" to build reproducible models. They simply show one approach to implementing the principles you've learned.
+
+**Nurse visit simulation:**
+
+{{< include ../../html/pydesrapmms.html >}}
+
+{{< include ../../html/rdesrapmms.html >}}
+
+**Stroke pathway simulation:**
+
+{{< include ../../html/pydesrapstroke.html >}}
+
+{{< include ../../html/rdesrapstroke.html >}}
+
+### Make your own model
+
+The best way to solidify what you've learned is to apply it. When planning you model, remember that a good simulation starts with **conceptual modelling**. As defined in Robinson (2007):
+
+> "The conceptual model is a non-software specific description of the simulation model that is to be developed, describing the objectives, inputs, outputs, content, assumptions and simplifications of the model."
+
+Some good resources on conceptual modelling include:
+
+* Robinson, Stewart. 2007. "Chapter 5: Conceptual Modelling." In Simulation: The Practice of Model Development and Use, 63–75. John Wiley & Sons.
+* Robinson, Stewart. 2007. "Chapter 6: Developing the Conceptual Model." In Simulation: The Practice of Model Development and Use, 77–93. John Wiley & Sons.
+
+This book focused on building simple model structures to help you establish robust foundations and reproducible workflows. However, real-world simulation models often involve additional features and complexities, such as reneging, balking, priority classes, resource scheduling, branching, blocking, or more detailed patients pathways.
+
+For inspiration on implementing a wider range of features, the [HSMA "little book of DES"](https://des.hsma.co.uk/) is an excellent resource. Its examples use Python, and the set up or structure may differ from those in this book, but the simulation principles apply whatever language you use. Focus on understanding the logic - how features are implemented and why - then adapt those ideas for the language, structure, or workflow that best fits your own model.
+
+### Review your existing models
+
+Already have simulation models in development or completed? Now's a great time to audit them against the practices you've learned.
+
+Use the checklists linked below to identify what you've already achieved (woohoo!) and what's missing. Then revisit relevant sections of the book to fill the gaps. Even small improvements - adding seeds, externalising parameters, or documenting dependencies - can significantly enhance your model's reproducibility.
+
+## Checklists
+
+Download checklists to **audit existing models** or **guide development of existing models**.
+
+You can see examples of completed checklists in the nurse visit and stroke simulation example model repositories.
+
+{{< downloadthis conclusion_resources/stars_reproducibility_recommendations.md dname="stars_reproducibility_recommendations" label="Download the STARS reproducibility recommendations" type="primary" >}}
+
+{{< downloadthis conclusion_resources/nhs_levels_of_rap.md dname="nhs_levels_of_rap" label="Download the NHS Levels of RAP maturity framework" type="primary" >}}
+
+Also, don't forget about the handy **verification and validation** checklist:
+
+{{< downloadthis ../verification_validation/verification_validation_resources/verification_validation_checklist.md dname="verification_validation_checklist" label="Download the verification and validation checklist" type="primary" >}}
+
+## Acknowledgements
+
+This book builds upon the generous work of many contributors to the open-source and simulation communities. We are particularly grateful for:
+
+* The **SimPy** and **simmer** development teams for creating and maintaining excellent open-source DES libraries.
+* The **NHS RAP Community of Practice** for their maturity framework.
+* The **HSMA Programme** (Health Service Modelling Associates) for their [little book of DES](https://des.hsma.co.uk/).
+* All the **researchers and practitioners who have openly shared their simulation models**, enabling the research that informed this book.
+* **Contributors and reviewers** who have provided feedback to improve this resource
+
+Full references and citations appear throughout the book where specific resources are discussed.
+
+### Please cite this book
+
+The code in this book is licensed under an **MIT License**, and the text is under **CC-BY-SA**, making it free to use, modify and share. However, we kindly ask/require that you **cite or acknowledge** this work when you use it.
+
+Suggested citation:
+
+> Heather, A., Monks, T., Mustafee, N., & Harper, A. (2025). DES RAP Book: Reproducible Discrete-Event Simulation in Python and R. https://github.com/pythonhealthdatascience/des_rap_book. https://doi.org/10.5281/zenodo.17094155.
+
+## Find out more about STARS
+
+This book is part of the **STARS (Sharing Tools and Artefacts for Reusable and Reproducible Simulations)** project, supported by the Medical Research Council [grant number MR/Z503915/1].
+
+![](../../images/stars_banner.png)
+
+STARS tackles the challenges of sharing, reusing, and reproducing discrete event simulation (DES) models in healthcare. Our goal is to create open resources using the two most popular open-source languages for DES: Python and R. As part of this project, you'll find tutorials, code examples, and tools to help researchers and practitioners develop, validate, and share DES models more effectively.
+
+Learn more:
+
+* **GitHub organisation:** <https://github.com/pythonhealthdatascience>
+
+<!-- TODO: Add link to STARS summary website once created -->
+
+## Well done! It's a journey, not a race
+
+Whether you're new to reproducible simulation or deepening your existing practice, you've taken important steps toward building more trustworthy, transparent models.
+
+Implementing these practices requires time and iteration. **Perfection is not required immediately - or ever**. Many of us are crunched for time, juggling multiple projects and deadlines. Finding space to improve workflows can feel impossible.
+
+When time is limited, focus on what matters most for your specific project. Go through the checklists or flip back through the book to identify key steps that would make the biggest difference. **Every small change moves your work forward.**
+
+**Don't be afraid to share your model**, even if it doesn't have all the bells and whistles. You benefit from others' shared work; others benefit from yours. Shared models spark conversations, enable collaboration, and push the field forward. Each model shared raises the bar for transparency and makes it easier for the next person to do the same.
+
+<br>
+
+*See the next page for details on giving feedback and contributing to this resource.*
+
+<br><br>