Add Quick search support for ensemble composing model parameter ranges #996

the-david-oy · 2025-12-03T20:14:09Z

Summary

Enables Quick search mode to optimize ensemble and BLS models with composing models that have different resource requirements (e.g., CPU tokenizers with high instance counts alongside GPU models with limited instances).

Previously, Quick search mode rejected any model with parameter ranges, preventing users from optimizing composing models independently. This implementation allows composing models to specify instance_group count ranges while maintaining the restriction for top-level models.

Changes

Config Validation (`config_command.py`)

Add _is_composing_model() helper to identify BLS and CPU-only composing models
Update _check_quick_search_model_config_parameters_combinations() to allow composing models to have parameter ranges
Update _check_per_model_model_config_parameters() to permit max_batch_size and instance_group ranges for composing models
Maintain existing restrictions for top-level models

SearchDimensions Construction (`run_config_generator_factory.py`)

Add _get_instance_count_list() to extract user-specified count lists from model configs
Add _create_instance_dimension_from_list() to create constrained SearchDimensions
Support two sequence types:
- Powers of 2: [1, 2, 4, 8, 16, 32] → EXPONENTIAL dimensions
- Contiguous sequences: [1, 2, 3, 4, 5] → LINEAR dimensions
Add _is_powers_of_two() and _is_linear_sequence() validators
Update _get_dimensions_for_model() to use user-specified lists when available

Coordinate Mapping (`quick_run_config_generator.py`)

Add _extract_instance_group_kind() to preserve user-specified KIND (CPU/GPU)
Update _get_next_model_config_variant() to extract kind before removing searchable parameters
Remove restrictive assertion that blocked composing models with multiple parameter combinations
Maintain single-combination requirement after removing searchable parameters

Documentation (`docs/config_search.md`)

Add "Ensemble Composing Model Parameter Ranges" subsection under Quick Search Mode
Provide complete YAML example with CPU tokenizer and GPU inference model
Document supported patterns and limitations
Add cross-references in Ensemble and BLS sections

Testing

Tests cover:

Valid patterns (powers of 2, contiguous sequences)
Invalid patterns with helpful error messages
Mixed scenarios (ranged + fixed parameters)
Edge cases (empty lists, single values, nested structures)

Example Configuration

model_repository: /path/to/model/repository/
run_config_search_mode: quick

cpu_only_composing_models:
  - tokenizer

profile_models:
  tokenizer:
    model_config_parameters:
      instance_group:
        - kind: KIND_CPU
          count: [1, 2, 4, 8, 16, 32]  # Search CPU instances
      dynamic_batching:
        max_queue_delay_microseconds: [0]

  inference_model:
    model_config_parameters:
      instance_group:
        - kind: KIND_GPU
          count: [1, 2, 4, 8]  # Search GPU instances
      dynamic_batching:
        max_queue_delay_microseconds: [0]

  ensemble_model:
    model_config_parameters:
      dynamic_batching:
        max_queue_delay_microseconds: [0]

## Impact

Enables optimization of ensemble models with heterogeneous composing models (e.g., CPU tokenizers + GPU inference).

tests/test_config_composing_model_validation.py

tests/test_ensemble_composing_model_integration.py

Copilot

Pull request overview

This PR enables Quick search mode to optimize ensemble and BLS models where composing models have different resource requirements (e.g., CPU tokenizers with high instance counts alongside GPU models with limited instances). Previously, Quick search mode rejected any model with parameter ranges.

Key changes:

Allow composing models to specify instance_group count ranges in Quick mode
Add validation logic to distinguish between top-level and composing models
Support powers-of-2 and contiguous sequence patterns for instance counts
Preserve user-specified instance_group KIND (CPU/GPU) during coordinate mapping

Reviewed changes

Copilot reviewed 9 out of 9 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
`tests/test_search_dimensions_factory.py`	New test suite validating SearchDimension creation from user-specified instance count lists
`tests/test_quick_run_config_generator.py`	Updated copyright header and test expectation for dynamic batching configuration
`tests/test_quick_coordinate_mapping.py`	New test suite for instance_group KIND extraction helper
`tests/test_ensemble_composing_model_integration.py`	Integration tests for ensemble models with mixed CPU/GPU composing models
`tests/test_config_composing_model_validation.py`	Validation tests ensuring composing models can use ranges while top-level models cannot
`model_analyzer/config/input/config_command.py`	Updated validation logic to permit parameter ranges for composing models
`model_analyzer/config/generate/run_config_generator_factory.py`	Added methods to create SearchDimensions from user-specified count lists
`model_analyzer/config/generate/quick_run_config_generator.py`	Enhanced coordinate mapping to preserve instance_group KIND and handle composing models
`docs/config_search.md`	Documentation for ensemble composing model parameter ranges feature

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

model_analyzer/config/generate/run_config_generator_factory.py

model_analyzer/config/generate/model_profile_spec.py

Fix copyrights Fix tests Update tests Allow user to specify composing models instead of relying on auto-discovery Warn when there is a non-existent composing model. Update copyrights Update copyrights Fix model name YAML Correctly get kind Properly set CPU/GPU kind Address Copilot feedback Address Copilot feedback Fix regex for CI, add config in test

github-advanced-security bot found potential problems Dec 3, 2025

View reviewed changes

the-david-oy force-pushed the doy-ensemble branch from b5c3607 to e821212 Compare December 3, 2025 20:16

the-david-oy marked this pull request as draft December 3, 2025 20:17

the-david-oy force-pushed the doy-ensemble branch from e821212 to 13abdd3 Compare December 3, 2025 22:00

the-david-oy requested a review from Copilot December 3, 2025 22:41

Copilot AI reviewed Dec 3, 2025

View reviewed changes

model_analyzer/config/generate/run_config_generator_factory.py Outdated Show resolved Hide resolved

model_analyzer/config/generate/run_config_generator_factory.py Outdated Show resolved Hide resolved

the-david-oy force-pushed the doy-ensemble branch 5 times, most recently from 7850677 to 275e9fb Compare December 5, 2025 00:09

github-advanced-security bot found potential problems Dec 5, 2025

View reviewed changes

model_analyzer/config/generate/model_profile_spec.py Fixed Show fixed Hide fixed

model_analyzer/config/generate/model_profile_spec.py Fixed Show fixed Hide fixed

the-david-oy force-pushed the doy-ensemble branch 2 times, most recently from 209e349 to db7fa66 Compare December 5, 2025 23:24

github-advanced-security bot found potential problems Dec 5, 2025

View reviewed changes

model_analyzer/config/generate/model_profile_spec.py Fixed Show fixed Hide fixed

the-david-oy force-pushed the doy-ensemble branch from db7fa66 to c0b1994 Compare December 5, 2025 23:35

the-david-oy force-pushed the doy-ensemble branch from c0b1994 to d450313 Compare December 6, 2025 02:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Quick search support for ensemble composing model parameter ranges #996

Add Quick search support for ensemble composing model parameter ranges #996

Uh oh!

the-david-oy commented Dec 3, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

2 participants

Add Quick search support for ensemble composing model parameter ranges #996

Are you sure you want to change the base?

Add Quick search support for ensemble composing model parameter ranges #996

Uh oh!

Conversation

the-david-oy commented Dec 3, 2025

Summary

Changes

Config Validation (config_command.py)

SearchDimensions Construction (run_config_generator_factory.py)

Coordinate Mapping (quick_run_config_generator.py)

Documentation (docs/config_search.md)

Testing

Example Configuration

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

2 participants

Config Validation (`config_command.py`)

SearchDimensions Construction (`run_config_generator_factory.py`)

Coordinate Mapping (`quick_run_config_generator.py`)

Documentation (`docs/config_search.md`)