support multiple device evaluation for activation quantized model #1394

wenhuach21 · 2026-02-04T06:58:50Z

Description

Please briefly describe your main changes, the motivation.

Type of Change

Related Issues

Fixes or relates to #

Checklist Before Submitting

My code has been tested locally.
Documentation has been updated as needed.
New or updated tests are included where applicable.

Copilot

Pull request overview

This PR addresses test and code improvements by commenting out multiple test cases and adding new device management functionality.

Changes:

Commented out extensive test cases in the scheme test file to focus on a single test_set_scheme test
Added a new dispatch_model_block_wise utility function for multi-device model dispatching
Updated evaluation code to use the new device dispatch mechanism
Changed a logging level from warning to info and added tie_weights() calls in multiple locations

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
test/test_cpu/schemes/test_scheme.py	Commented out multiple test cases, leaving only `test_set_scheme` active
auto_round/utils/device.py	Added new `dispatch_model_block_wise` function for block-wise model dispatching across devices
auto_round/eval/evaluation.py	Updated `prepare_model_for_eval` to use new device dispatch utility and changed parameter name
auto_round/compressors/base.py	Changed logger level from warning to info, added `tie_weights()` call, and modified error raising
auto_round/auto_scheme/utils.py	Added `tie_weights()` call before device map inference

auto_round/utils/device.py

auto_round/compressors/base.py

for more information, see https://pre-commit.ci

…ix_0204

for more information, see https://pre-commit.ci

…ix_0204 # Conflicts: # auto_round/eval/eval_cli.py

for more information, see https://pre-commit.ci

Signed-off-by: n1ck-guo <heng.guo@intel.com>

xin3he · 2026-02-10T02:03:13Z

auto_round/utils/device.py

+        # temporary tensors, other processes, and allocator fragmentation, reducing
+        # the chance of runtime OOM while still utilizing most available memory.
+        new_max_memory[device] = max_memory[device] * max_mem_ratio
+    new_max_memory = get_balanced_memory(


In my memory, this will use all CUDA_VISIBLE_DEVICES. Setting CUDA_VISIBLE_DEVICES with device_map information might help.

wenhuach21 added 5 commits February 2, 2026 18:21

fix --device_map cuda xpu issue

ac9dda2

Merge branch 'main' of https://github.com/intel/auto-round

4bd4d09

Merge branch 'main' of https://github.com/intel/auto-round

e70fbf8

Merge branch 'main' of https://github.com/intel/auto-round

176cea9

support user model evaluation with multi devices

62ef8c5

Copilot AI review requested due to automatic review settings February 4, 2026 06:58

Copilot AI reviewed Feb 4, 2026

View reviewed changes

auto_round/utils/device.py Outdated Show resolved Hide resolved

auto_round/utils/device.py Outdated Show resolved Hide resolved

auto_round/compressors/base.py Outdated Show resolved Hide resolved

wenhuach21 and others added 5 commits February 4, 2026 15:02

revert changes

56f95a3

Merge branch 'main' into fix_0204

6bef956

[pre-commit.ci] auto fixes from pre-commit.com hooks

cc9b828

for more information, see https://pre-commit.ci

fix

9f99e04

Merge branch 'fix_0204' of https://github.com/intel/auto-round into f…

4ea9d58

…ix_0204

wenhuach21 changed the title ~~Fix 0204~~ support multiple device evaluation for activation quantized model Feb 4, 2026

wenhuach21 and others added 11 commits February 4, 2026 15:14

fix

23001c6

fix

15669d9

[pre-commit.ci] auto fixes from pre-commit.com hooks

b8ff386

for more information, see https://pre-commit.ci

fix

439e92b

Merge branch 'fix_0204' of https://github.com/intel/auto-round into f…

1925c3b

…ix_0204 # Conflicts: # auto_round/eval/eval_cli.py

[pre-commit.ci] auto fixes from pre-commit.com hooks

68e31b1

for more information, see https://pre-commit.ci

fix

62673d4

[pre-commit.ci] auto fixes from pre-commit.com hooks

548dd51

for more information, see https://pre-commit.ci

fix eval_task_by_task

37b1b2e

Signed-off-by: n1ck-guo <heng.guo@intel.com>

fix bug

54cd195

Signed-off-by: n1ck-guo <heng.guo@intel.com>

fix

4e78b49

Signed-off-by: n1ck-guo <heng.guo@intel.com>

xin3he reviewed Feb 10, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support multiple device evaluation for activation quantized model #1394

support multiple device evaluation for activation quantized model #1394

Uh oh!

wenhuach21 commented Feb 4, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

xin3he Feb 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

support multiple device evaluation for activation quantized model #1394

Are you sure you want to change the base?

support multiple device evaluation for activation quantized model #1394

Uh oh!

Conversation

wenhuach21 commented Feb 4, 2026

Description

Type of Change

Related Issues

Checklist Before Submitting

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

xin3he Feb 10, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants