Fix unsafe deserialization in ETRecord and export_serialize by lucylq · Pull Request #18133 · pytorch/executorch

lucylq · 2026-03-12T17:35:14Z

Add weights_only=True to torch.load() in deserialize_torch_artifact()
to prevent arbitrary code execution via malicious serialized artifacts.

Replace usages of pickle.dumps and pickle.loads with torch.save and torch.load

As generating and reading etrecord should be done as debugging steps and usually on the same version of et, BC is not a big concern.

This PR was authored with the assistance of Claude.

pytorch-bot · 2026-03-12T17:35:20Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18133

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 Awaiting Approval, 1 New Failure, 3 Pending, 9 Unrelated Failures

As of commit 1bf0180 with merge base eaf0d65 ():

AWAITING APPROVAL - The following workflow needs approval before CI can run:

Claude Code (gh)

NEW FAILURE - The following job has failed:

Build Presets / windows (windows) / build (gh)

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

pull / unittest / windows / windows-job (gh) (similar failure)
The process 'C:\Program Files\Git\cmd\git.exe' failed with exit code 1
Test CUDA Builds / test-models-cuda (mv2) / linux-job (gh) (matched linux rule in flaky-rules.json)
The runner has received a shutdown signal. This can happen when the runner service is stopped, or a manually started runner is canceled.
Test CUDA Windows Export and E2E / test-model-cuda-windows-e2e (mistralai, Voxtral-Mini-4B-Realtime-2602, quantized-int4-tile-packed) / windows-job (gh) (similar failure)
The process 'C:\Program Files\Git\cmd\git.exe' failed with exit code 1
Test CUDA Windows Export and E2E / test-model-cuda-windows-e2e (nvidia, diar_streaming_sortformer_4spk-v2, non-quantized) / windows-job (gh) (similar failure)
The process 'C:\Program Files\Git\cmd\git.exe' failed with exit code 1
Test CUDA Windows Export and E2E / test-model-cuda-windows-e2e (nvidia, parakeet-tdt, non-quantized) / windows-job (gh) (similar failure)
The process 'C:\Program Files\Git\cmd\git.exe' failed with exit code 1

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

Build Presets / windows (pybind) / build (gh) (trunk failure)
The process 'C:\Program Files\Git\cmd\git.exe' failed with exit code 1
pull / unittest-editable / windows / windows-job (gh) (trunk failure)
The process 'C:\Program Files\Git\cmd\git.exe' failed with exit code 1
Test CUDA Windows Export and E2E / test-model-cuda-windows-e2e (mistralai, Voxtral-Mini-3B-2507, non-quantized) / windows-job (gh) (trunk failure)
The process 'C:\Program Files\Git\cmd\git.exe' failed with exit code 1
Test CUDA Windows Export and E2E / test-model-cuda-windows-e2e (nvidia, parakeet-tdt, quantized-int4-weight-only) / windows-job (gh) (trunk failure)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2026-03-12T17:36:19Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

Add weights_only=True to torch.load() in deserialize_torch_artifact() to prevent arbitrary code execution via malicious serialized artifacts. Add security warnings to parse_etrecord() docstring and inline comments for the pickle.loads() calls on reference_outputs and representative_inputs, which remain unsafe but cannot be replaced without breaking the serialization format. Addresses TOB-EXECUTORCH-11. This PR was authored with the assistance of Claude.

Switch reference_outputs and representative_inputs serialization from pickle.dumps/loads to torch.save/torch.load(weights_only=True). The data types (Dict[str, List[List[Tensor]]] and List[Union[Tensor, int, float, bool]]) are all in the weights_only allowlist. ETRecord files are transient dev artifacts with a single writer and reader, so no backward-compatibility fallback is needed. Old ETRecord files can be regenerated from source. Addresses TOB-EXECUTORCH-11. This PR was authored with the assistance of Claude.

Copilot

Pull request overview

Hardens serialization/deserialization paths by switching away from raw pickle usage and enabling safer torch.load(..., weights_only=True) to mitigate arbitrary code execution risks from untrusted artifacts.

Changes:

Use torch.load(..., weights_only=True) in deserialize_torch_artifact.
Replace pickle.dumps/loads for ETRecord metadata fields with torch.save/load using in-memory buffers.
Remove pickle import and add io usage for byte buffering.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 4 comments.

File	Description
exir/serde/export_serialize.py	Enables `weights_only=True` for safer torch artifact deserialization.
devtools/etrecord/_etrecord.py	Replaces pickle-based storage/loading of ETRecord metadata with torch serialization and safer loading.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

You can also share your feedback on Copilot code review. Take the survey.

Copilot · 2026-03-12T18:59:49Z

devtools/etrecord/_etrecord.py

+            reference_outputs = torch.load(
+                io.BytesIO(
+                    etrecord_zip.read(ETRecordReservedFileNames.REFERENCE_OUTPUTS)
+                ),
+                weights_only=True,
            )
        elif entry == ETRecordReservedFileNames.REPRESENTATIVE_INPUTS:
-            # @lint-ignore PYTHONPICKLEISBAD
-            representative_inputs = pickle.loads(
-                etrecord_zip.read(ETRecordReservedFileNames.REPRESENTATIVE_INPUTS)
+            representative_inputs = torch.load(
+                io.BytesIO(
+                    etrecord_zip.read(ETRecordReservedFileNames.REPRESENTATIVE_INPUTS)
+                ),
+                weights_only=True,
            )


This change makes parse_etrecord unable to read existing ETRecords that were written with pickle.dumps for these entries (the on-disk format changed). Consider adding a backward-compatible read path: attempt torch.load(..., weights_only=True) first, and if it fails with a known/expected exception, fall back to legacy pickle.loads only under an explicit opt-in (e.g., a trust_legacy_pickle: bool param or env var) plus a clear warning/error. Without this, previously-generated ETRecords will fail to load.

Copilot · 2026-03-12T18:59:50Z

exir/serde/export_serialize.py

    buffer = io.BytesIO(serialized)
    buffer.seek(0)
-    artifact = torch.load(buffer)
+    artifact = torch.load(buffer, weights_only=True)


weights_only is not supported in some older PyTorch versions. If this repository supports a wider PyTorch range, this will raise TypeError: load() got an unexpected keyword argument 'weights_only'. Consider a small compatibility shim (e.g., feature-detect via inspect.signature(torch.load) or try/except TypeError) so the code fails gracefully with an actionable message (or uses a safe alternative) when weights_only is unavailable.

Suggested change

artifact = torch.load(buffer, weights_only=True)

# `weights_only` is not supported in some older PyTorch versions.

# Feature-detect support and fall back gracefully if unavailable.

try:

load_sig = inspect.signature(torch.load)

if "weights_only" in load_sig.parameters:

artifact = torch.load(buffer, weights_only=True)

else:

logging.warning(

"torch.load does not support the 'weights_only' argument in this "

"PyTorch version; falling back to torch.load(buffer) without it."

)

artifact = torch.load(buffer)

except (TypeError, ValueError):

# As an extra safeguard, fall back to calling without `weights_only`

logging.warning(

"Unable to inspect torch.load signature; falling back to "

"torch.load(buffer) without 'weights_only'."

)

artifact = torch.load(buffer)

Copilot · 2026-03-12T18:59:50Z

devtools/etrecord/_etrecord.py

+            buf = io.BytesIO()
+            torch.save(self._reference_outputs, buf)
            etrecord_zip.writestr(
                ETRecordReservedFileNames.REFERENCE_OUTPUTS,
-                pickle.dumps(self._reference_outputs),
+                buf.getvalue(),
            )

        if self._representative_inputs is not None:
+            buf = io.BytesIO()
+            torch.save(self._representative_inputs, buf)
            etrecord_zip.writestr(
                ETRecordReservedFileNames.REPRESENTATIVE_INPUTS,
-                pickle.dumps(self._representative_inputs),
+                buf.getvalue(),
            )


torch.save uses a zip-based format by default; embedding those bytes inside another ZipFile can noticeably increase size and CPU overhead (zip-inside-zip). If these fields are large, consider using torch.save(..., _use_new_zipfile_serialization=False) to avoid nested zip containers, or an alternative serialization for purely-tensor content to reduce overhead.

Copilot · 2026-03-12T18:59:50Z

devtools/etrecord/_etrecord.py

+            reference_outputs = torch.load(
+                io.BytesIO(
+                    etrecord_zip.read(ETRecordReservedFileNames.REFERENCE_OUTPUTS)
+                ),
+                weights_only=True,
            )


Consider forcing map_location='cpu' when loading from ETRecords. If tensors were saved on CUDA, torch.load can require GPU availability or attempt GPU allocations during parsing, which is undesirable for a devtool reader and can break in CPU-only environments.

Gasoonjia · 2026-03-12T21:55:18Z

can you trigger import to pay attention to the internal ci?

meta-codesync · 2026-03-12T23:19:13Z

@lucylq has imported this pull request. If you are a Meta employee, you can view this in D96389611.

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 12, 2026

lucylq force-pushed the security11 branch from e9884b2 to 7e82c82 Compare March 12, 2026 17:39

lucylq mentioned this pull request Mar 12, 2026

Security #17738

Open

lucylq force-pushed the security11 branch from 304a354 to e328980 Compare March 12, 2026 18:47

lucylq force-pushed the security11 branch from e328980 to 1bf0180 Compare March 12, 2026 18:53

lucylq marked this pull request as ready for review March 12, 2026 18:54

lucylq requested review from Gasoonjia, JacobSzwejbka and larryliu0820 as code owners March 12, 2026 18:54

Copilot AI review requested due to automatic review settings March 12, 2026 18:54

Copilot AI reviewed Mar 12, 2026

View reviewed changes

Copilot started reviewing on behalf of lucylq March 12, 2026 19:03 View session

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix unsafe deserialization in ETRecord and export_serialize#18133

Fix unsafe deserialization in ETRecord and export_serialize#18133
lucylq wants to merge 2 commits intomainfrom
security11

lucylq commented Mar 12, 2026 •

edited

Loading

Uh oh!

pytorch-bot bot commented Mar 12, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Mar 12, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 12, 2026

Uh oh!

Copilot AI Mar 12, 2026

Uh oh!

Copilot AI Mar 12, 2026

Uh oh!

Copilot AI Mar 12, 2026

Uh oh!

Gasoonjia commented Mar 12, 2026

Uh oh!

meta-codesync bot commented Mar 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

-    artifact = torch.load(buffer, weights_only=True)
+    # `weights_only` is not supported in some older PyTorch versions.
+    # Feature-detect support and fall back gracefully if unavailable.
+    try:
+        load_sig = inspect.signature(torch.load)
+        if "weights_only" in load_sig.parameters:
+            artifact = torch.load(buffer, weights_only=True)
+        else:
+            logging.warning(
+                "torch.load does not support the 'weights_only' argument in this "
+                "PyTorch version; falling back to torch.load(buffer) without it."
+            )
+            artifact = torch.load(buffer)
+    except (TypeError, ValueError):
+        # As an extra safeguard, fall back to calling without `weights_only`
+        logging.warning(
+            "Unable to inspect torch.load signature; falling back to "
+            "torch.load(buffer) without 'weights_only'."
+        )
+        artifact = torch.load(buffer)

Conversation

lucylq commented Mar 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Mar 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18133

❌ 1 Awaiting Approval, 1 New Failure, 3 Pending, 9 Unrelated Failures

Uh oh!

github-actions bot commented Mar 12, 2026

This PR needs a release notes: label

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

Gasoonjia commented Mar 12, 2026

Uh oh!

meta-codesync bot commented Mar 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

lucylq commented Mar 12, 2026 •

edited

Loading

pytorch-bot bot commented Mar 12, 2026 •

edited

Loading

This PR needs a `release notes:` label