IBMQExperiment and FreeformDesign/CombinedExperimentDesign quality-of-life updates #379

sserita · 2023-12-02T06:35:42Z

This PR makes several quality-of-life improvements for IBMQExperiment, including checkpointing for IBMQ experiments mentioned in #327, as well as FreeformDesign and CombinedExperimentDesign serialization.

IBMQExperiment updates:

The IBMQExperiment class is changed from a dict to a full class that inherits from TreeNode, making it commensurate with serialization of objects such as ExperimentDesign and ProtocolData.
With a more granular serialization strategy, it is now possible to checkpoint IBMQExperiment objects. The API follows GST checkpointing where checkpoint_path and disable_checkpointing can be passed to the constructor (checkpointing is on by default). If checkpoint_path is provided, then transpile(), submit(), or retrieve_results will update the on-disk ibmqexperiment when possible.
If bson is installed, we can use the json_util hook to handle objects such as datetime.datetime which are present in submit_time_calibration_data and batch_results from the IBMQ server. Depending on the availability of bson, we can serialize to all text/json files, but fall back to pickle still.

FreeformDesign updates:

The aux_info member of FreeformDesign is written as JSON instead of pickle. To do this, first we cast the Circuit keys as strings and then write the now-JSONable dictionary to file. We convert back to Circuits on deserialization.
More controversially, I skip serializing all_circuits_needing_data for FreeformDesign. This field should match the keys of aux_info, and with the size of these designs, it saves quite a lot of space to not double save them (almost 1 GB for some of my SVB use cases!).

CombinedExperimentDesign updates:

The constructor now takes an additional skip_writing_all_circuits flag. The intention is that in cases where all_circuits_needing_data is simply the union of subdesigns, this can be regenerated on the fly. This saves space on disk and cuts the write/read times for the largest thing being written (cutting my 1 GB down to ~500 MB for my SVB use cases!).

I've done my best to make it backwards-compatible; for example, we can still load old pickle-style IBMQExperiment directories into the new object. However, the IBMQExperiment workflow has to change a little to enable checkpointing - a minor pain point that I hope is well worth it for most users.

Remaining Tasks

After a round of feedback, here's the current list of additional tasks:

Check that instruments work with this
Make checkpointing default and in-line with new GST checkpointing kwarg, if possible
Add unit tests using Qiskit mock server
Use new Qiskit Sessions to future-proof against eventual IBM Provider deprecation

The goal is to add checkpointing, which will be facilitated by making this properly serializable.

Avoid pickling and don't write all_circuits_needing_data. Should cut on-disk space in half, and circuits can be reinitialized from keys (should also save on circ construction time)

Checkpointing is facilitated by moving this from a dict to a class that inherits from TreeNode and serializes more "pyGSTi-like". All major IMBQExperiment stages (transpile, submit, retrieve results) are accompanied by internal writes of the ibmqexperiment directory, which can be loaded from as checkpoints.

…rimentDesign The thought process is that in many cases, the all_circuits_needing_data for CombinedExperimentDesign is simply a union of subdesign lists. In this case, a user can opt out of saving this and just regenerate it on serialization (thereby saving 2x disk space and save/load time).

rileyjmurray · 2023-12-13T12:23:50Z

@sserita, I got a ping about reviewing this due to being a code owner. From what I can see this doesn't touch code that I have prior experience with. Maybe a change in my team in the CODEOWNERS file is in order?

coreyostrove · 2023-12-14T23:27:52Z

pygsti/extras/ibmq/ibmqcore.py

+            self.auxfile_types['batch_results'] = 'pickle'
+
+        if not disable_checkpointing:
+            if self.checkpoint_path is None:


Since we're enabling checkpointing by default I think it would be worthwhile implementing a default path here for when the checkpoint_path kwarg isn't set by a user. This something we currently do in the GST protocol checkpointing code, fwiw, in case referencing how it is handled in that part of the code would be useful.

I guess that is fair. One reason I did it this way is that the "checkpoint" here is actually a completely valid IBMQExperiment object that can be loaded using from_dir(), so in my experience so far it has been nice to have it match the dirname for write() so that the user doesn't have to change how they load from checkpoint vs a .write()... but it's a good point that a default path could simplify this for users. I'll think about how I want to incorporate that here.

coreyostrove · 2023-12-14T23:47:15Z

pygsti/protocols/protocol.py

    def __init__(self, sub_designs, all_circuits=None, qubit_labels=None, sub_design_dirs=None,
-                 interleave=False):
+                 interleave=False, skip_writing_all_circuits=False):
        """


I think the general idea of having the option to skip writing all_circuits is alright, but having this be something that is specified as an attribute of a class instance during the constructor stage feels suboptimal. This feels like something that would fit better as an optional flag passed into the write method. Of course, doing so would mean you'd need to implement an overriding version of write specific to CombinedExperimentDesign. Then again it looks like this is the tactic that was taken for the FreeformDesign, so maybe that isn't so bad?

I actually tried that first, but it got a little nasty because a) it was doing a lot of manipulation of auxfile_types on the fly, and b) I really want to trigger that option as part of ProtocolData, IBMQExperiment, etc and that would mean plumbing up an option that was only used for CombinedExperimentDesign through everything that could possibly hold an edesign... Any suggestions on avoiding that plumbing are welcome, the class member was my first pass at it but I agree it's pretty awkward.

Actually I'm not sure why I didn't think of this, but I can just check whether all_circuits_needing_data matches the set of all subdesigns... Duh. (Idea came from your assert comment below.)

coreyostrove · 2023-12-15T00:10:08Z

Great work, @sserita! I posted a couple of code specific comments above, but had a couple of broader comments to go along with them. I think the main thing that this PR is missing is some unit tests. As far as I can tell we currently have no coverage of the IBMQExperiment in any of our test modules. More and more users are reliant on this code so this would be a good opportunity to at least start to fill that gap by adding some tests for the new checkpointing functionality. Thoughts on this? We obviously don't want to have the runners making a bunch of API calls to IBM's servers as part of testing, but I took a look at the code and at first glance it looks like the transpile method doesn't make any external API calls, so this could be a good candidate for testing. I forget off-hand whether using the simulator backend runs the simulations locally or on their cloud, but if it is locally that might be a good option for extending coverage to things like the submit method.

Relatedly, while we have some coverage on serialization for experiment designs, it looks like presently we don't have have anything for FreeformDesign (we actually only have one test that touched FreeformDesign period, and that is related to a qubit label mapping method), so this could be a good thing to add. Ditto with the new CombinedExperimentDesign serialization option (for skipping all_circuits).

coreyostrove · 2023-12-15T00:43:17Z

pygsti/protocols/protocol.py

+
+        # Don't save all_circuits_needing_data, it's redundant with aux_info keys
+        self.auxfile_types['all_circuits_needing_data'] = 'reset'
+        # Currently not jsonable, but will be fixed in write()


Would it be possible to add an assertion or other check here that check this is true? That is, that self.aux_info.keys() is equivalent to all_circuits_needing_data? This is definitely true initially upon construction, but it also looks like it is possible for these to become out of sync during the course of standard usage. For example, the base ExperimentDesign class has the method truncate_to_circuits which is a wrapper around _truncate_to_circuits_inplace. Looking at the implementation of _truncate_to_circuits_inplace this operates directly on all_circuits_needing_data and we never re-sync this with aux_info. There are a couple other methods where this happens as well.

Good point. I think probably other things would break if they got out of sync too, but an assert here to check is very easy.

coreyostrove

First time using the reviewer interface, so didn't realize my other comments probably should have gone in here, sorry!
See above comments for more.

pcwysoc · 2025-04-22T19:37:36Z

@sserita This is non-urgent, please ignore until not OOO. I'm running into a bug when attempting to transpile edesigns with qubit labels that do not start at qubit 0 (i.e., qubit_labels = ('Q27', ) ):

QASM2ParseError Traceback (most recent call last)
Cell In[13], line 1
----> 1 exp1Q.transpile(backend)

File ~/pyGSTi/pygsti/extras/ibmq/ibmqexperiment.py:567, in IBMQExperiment.transpile(self, ibmq_backend, qiskit_pass_kwargs, qasm_convert_kwargs, num_workers)
563 # Run in parallel (p.imap) with progress bars (tqdm)
564 #with _mp.Pool(num_workers) as p:
565 # isa_circuits = list(_tqdm.tqdm(p.imap(task_fn, tasks), total=len(tasks)))
566 for task in _tqdm.tqdm(tasks):
--> 567 self.qiskit_isa_circuit_batches.append(task_fn(task))
569 # Save single batch
570 chkpt_path = _pathlib.Path(self.checkpoint_path) / "ibmqexperiment"

File ~/pyGSTi/pygsti/extras/ibmq/ibmqexperiment.py:63, in _transpile_batch(circs, pass_manager, qasm_convert_kwargs)
60 for circ in circs:
61 # TODO: Replace this with direct to qiskit
62 pygsti_openqasm_circ = circ.convert_to_openqasm(**qasm_convert_kwargs)
---> 63 qiskit_qc = _qiskit.QuantumCircuit.from_qasm_str(pygsti_openqasm_circ)
64 batch.append(qiskit_qc)
66 # Run pass manager on batch

File /opt/homebrew/Caskroom/miniconda/base/envs/pygsti_1_new_qiskit/lib/python3.10/site-packages/qiskit/circuit/quantumcircuit.py:4025, in QuantumCircuit.from_qasm_str(qasm_str)
4022 # pylint: disable=cyclic-import
4023 from qiskit import qasm2
-> 4025 return qasm2.loads(
4026 qasm_str,
4027 include_path=qasm2.LEGACY_INCLUDE_PATH,
4028 custom_instructions=qasm2.LEGACY_CUSTOM_INSTRUCTIONS,
4029 custom_classical=qasm2.LEGACY_CUSTOM_CLASSICAL,
4030 strict=False,
4031 )

File /opt/homebrew/Caskroom/miniconda/base/envs/pygsti_1_new_qiskit/lib/python3.10/site-packages/qiskit/qasm2/init.py:587, in loads(string, include_path, custom_instructions, custom_classical, strict)
571 """Parse an OpenQASM 2 program from a string into a :class:.QuantumCircuit.
572
573 Args:
(...)
584 A circuit object representing the same OpenQASM 2 program.
585 """
586 custom_instructions = list(custom_instructions)
--> 587 return _parse.from_bytecode(
588 _qasm2.bytecode_from_string(
589 string,
590 [_normalize_path(path) for path in include_path],
591 [
592 _qasm2.CustomInstruction(x.name, x.num_params, x.num_qubits, x.builtin)
593 for x in custom_instructions
594 ],
595 tuple(custom_classical),
596 strict,
597 ),
598 custom_instructions,
599 )

File /opt/homebrew/Caskroom/miniconda/base/envs/pygsti_1_new_qiskit/lib/python3.10/site-packages/qiskit/qasm2/parse.py:211, in from_bytecode(bytecode, custom_instructions)
208 # Pull this out as an explicit iterator so we can manually advance the loop in DeclareGate
209 # contexts easily.
210 bc = iter(bytecode)
--> 211 for op in bc:
212 # We have to check op.opcode so many times, it's worth pulling out the extra attribute
213 # access. We should check the opcodes in order of their likelihood to be in the OQ2 program
214 # for speed. Gate applications are by far the most common for long programs. This function
215 # is deliberately long and does not use hashmaps or function lookups for speed in
216 # Python-space.
217 opcode = op.opcode
218 # OpCode is an enum in Rust, but its instances don't have the same singleton property as
219 # Python enum.Enum objects.

QASM2ParseError: ":7,5: index 27 is out-of-range for register 'q' of size 1"

pcwysoc · 2025-04-22T19:39:09Z

@sserita Also, I wanted to check to make sure the bug with using add_count_list to populate the dataset was fixed (we had been using add_count_dict as a work around). This is non-urgent, please ignore until no longer OOO.

ndsieki · 2025-08-05T17:43:42Z

There is an issue with the checkpointing (or possibly the reloading) of pyGSTi circuits with float args. Here is an example:

from pygsti.extras.ibmq import IBMQExperiment
from pygsti.baseobjs.label import Label
from pygsti.circuits import Circuit
from pygsti.protocols import FreeformDesign

starting_circ = Circuit([Label('Gu3', ['Q0'], args=[1.2e-5, 1.0, 3.14])])
circ_dict = {starting_circ: 'bug'}

edesign = FreeformDesign(circ_dict)
circ_after_edesign = edesign.all_circuits_needing_data[0]

exp = IBMQExperiment(edesign, pspec=None)
circ_after_exp = exp.edesign.all_circuits_needing_data[0]

exp2 = IBMQExperiment.from_dir('ibmqexperiment_checkpoint')
circ_after_reload_exp = exp2.edesign.all_circuits_needing_data[0]

print(starting_circ.layer_label(0).args)
print(circ_after_edesign.layer_label(0).args)
print(circ_after_exp.layer_label(0).args)
print(circ_after_reload_exp.layer_label(0).args)

Output:

(1.2e-05, 1.0, 3.14)
(1.2e-05, 1.0, 3.14)
(1.2e-05, 1.0, 3.14)
('1.2e-05', 1.0, 3.14)

The string cast is undesirable.

ndsieki · 2025-08-06T22:13:35Z

There is an issue with the checkpointing (or possibly the reloading) of pyGSTi circuits with float args. Here is an example:

from pygsti.extras.ibmq import IBMQExperiment
from pygsti.baseobjs.label import Label
from pygsti.circuits import Circuit
from pygsti.protocols import FreeformDesign

starting_circ = Circuit([Label('Gu3', ['Q0'], args=[1.2e-5, 1.0, 3.14])])
circ_dict = {starting_circ: 'bug'}

edesign = FreeformDesign(circ_dict)
circ_after_edesign = edesign.all_circuits_needing_data[0]

exp = IBMQExperiment(edesign, pspec=None)
circ_after_exp = exp.edesign.all_circuits_needing_data[0]

exp2 = IBMQExperiment.from_dir('ibmqexperiment_checkpoint')
circ_after_reload_exp = exp2.edesign.all_circuits_needing_data[0]

print(starting_circ.layer_label(0).args)
print(circ_after_edesign.layer_label(0).args)
print(circ_after_exp.layer_label(0).args)
print(circ_after_reload_exp.layer_label(0).args)

Output:

(1.2e-05, 1.0, 3.14)
(1.2e-05, 1.0, 3.14)
(1.2e-05, 1.0, 3.14)
('1.2e-05', 1.0, 3.14)

The string cast is undesirable.

Additionally, if one looks at what happens to the batch_results attribute (pattern matching off the code above):

print(batch_results_after_exp)
print(batch_results_after_reload_exp)

Output:

[]
None

If batch_results is None and IBMQExperiment.retrieve_results() is called, errors occur due to the expectation that batch_results is a list. For example, line 290 and 300 in ibmqexperiment.py.

Fix a bug in the default subdesign naming of CombinedExperimentDesign. Previous used the '*' character which is a forbidden filename/directory character on windows, which led to problems writing to disk.

Update the IBMQ unit tests for compatibility with newest QISKIT and add a pair of new integration tests with commonly used end-to-end IBMQ workflows (MRB and MCM GST).

coreyostrove · 2025-08-12T05:00:00Z

This is non-urgent, please ignore until not OOO. I'm running into a bug when attempting to transpile edesigns with qubit labels that do not start at qubit 0 (i.e., qubit_labels = ('Q27', ) ):

@pcwysoc: I tried to reproduce this error on this branch and wasn't able to do so. If you encounter this error again or are able to reproduce it please open up a github issue with the steps to do so.

coreyostrove · 2025-08-12T05:01:01Z

Also, I wanted to check to make sure the bug with using add_count_list to populate the dataset was fixed (we had been using add_count_dict as a work around). This is non-urgent, please ignore until no longer OOO.

@pcwysoc: Can you tell me more about the bug that you're referring to?

Fix a bug in parsing quantum circuits with arguments when using floats in scientific notation.

Update the deserialization of batch_results to return an empty list when deserialized as None.

coreyostrove · 2025-08-12T21:57:41Z

There is an issue with the checkpointing (or possibly the reloading) of pyGSTi circuits with float args. Here is an example:
from pygsti.extras.ibmq import IBMQExperiment
from pygsti.baseobjs.label import Label
from pygsti.circuits import Circuit
from pygsti.protocols import FreeformDesign

starting_circ = Circuit([Label('Gu3', ['Q0'], args=[1.2e-5, 1.0, 3.14])])
circ_dict = {starting_circ: 'bug'}

edesign = FreeformDesign(circ_dict)
circ_after_edesign = edesign.all_circuits_needing_data[0]

exp = IBMQExperiment(edesign, pspec=None)
circ_after_exp = exp.edesign.all_circuits_needing_data[0]

exp2 = IBMQExperiment.from_dir('ibmqexperiment_checkpoint')
circ_after_reload_exp = exp2.edesign.all_circuits_needing_data[0]

print(starting_circ.layer_label(0).args)
print(circ_after_edesign.layer_label(0).args)
print(circ_after_exp.layer_label(0).args)
print(circ_after_reload_exp.layer_label(0).args)
Output:
(1.2e-05, 1.0, 3.14)
(1.2e-05, 1.0, 3.14)
(1.2e-05, 1.0, 3.14)
('1.2e-05', 1.0, 3.14)
The string cast is undesirable.
Additionally, if one looks at what happens to the batch_results attribute (pattern matching off the code above):
print(batch_results_after_exp)
print(batch_results_after_reload_exp)
Output:
[]
None
If batch_results is None and IBMQExperiment.retrieve_results() is called, errors occur due to the expectation that batch_results is a list. For example, line 290 and 300 in ibmqexperiment.py.

These have now been patched.

Some new changes in python 3.12 seem to have created an incompatibility with the dataframe conversion logic using pandas. Need to track this down, but for now temporarily turning this off.

coreyostrove

Fantastic work, thanks for the significant effort making these changes!

pcwysoc · 2025-08-13T16:15:10Z

Woohoo!

Also, I wanted to check to make sure the bug with using add_count_list to populate the dataset was fixed (we had been using add_count_dict as a work around). This is non-urgent, please ignore until no longer OOO.

@pcwysoc: Can you tell me more about the bug that you're referring to?

Believe this was fixed in a different patch.

sserita added 9 commits November 21, 2023 15:20

First pass at IBMQ checkpointing.

c78233e

First pass reworking IBMQExperiment

db63eb8

The goal is to add checkpointing, which will be facilitated by making this properly serializable.

Merge branch 'develop' into feature-svb-qol-updates

368e31c

Updates to FreeformDesign serialization

e5ac246

Avoid pickling and don't write all_circuits_needing_data. Should cut on-disk space in half, and circuits can be reinitialized from keys (should also save on circ construction time)

Further update to FreeformDesign serialization

7962383

Fix new FreeformDesign serialization

fad56e2

Add deserialization support for old pickle format

2782417

Clean up tutorial.

b325ef7

sserita requested a review from enielse December 2, 2023 06:35

sserita linked an issue Dec 2, 2023 that may be closed by this pull request

Enable IBMQExperiment checkpointing #327

Closed

sserita removed a link to an issue Dec 2, 2023

Enable IBMQExperiment checkpointing #327

Closed

sserita self-assigned this Dec 11, 2023

sserita linked an issue Dec 11, 2023 that may be closed by this pull request

Enable IBMQExperiment checkpointing #327

Closed

sserita marked this pull request as draft December 12, 2023 17:45

sserita added 4 commits December 12, 2023 13:05

Make IBMQExperiment chkpting in line with GST chkpting

f8fda1f

Finish docstring

df4bcc8

Bugfixes for serialization updates

8f5b05a

sserita marked this pull request as ready for review December 12, 2023 22:40

sserita requested review from coreyostrove and rileyjmurray as code owners December 12, 2023 22:40

sserita added this to the 0.9.12.1 milestone Dec 12, 2023

sserita changed the title ~~IBMQExperiment and FreeformDesign quality-of-life updates~~ IBMQExperiment and FreeformDesign/CombinedExperimentDesign quality-of-life updates Dec 12, 2023

coreyostrove reviewed Dec 14, 2023

View reviewed changes

coreyostrove reviewed Dec 15, 2023

View reviewed changes

coreyostrove requested changes Dec 15, 2023

View reviewed changes

sserita modified the milestones: 0.9.13, 0.9.13.1 Jan 16, 2025

Add option to use prior session

ee3b66b

sserita modified the milestones: 0.9.13.1, 0.9.13.2 Mar 18, 2025

sserita added 2 commits March 18, 2025 15:09

Merge branch 'develop' into feature-svb-qol-updates

7983d8e

Finish merge for pyproject.toml

c7764e7

tjproct added 2 commits June 30, 2025 21:36

Updated for QisKit 2.1.0

f6a62ed

Updated for QisKit 2.1.0

6cd1922

ndsieki mentioned this pull request Aug 11, 2025

Mirror circuit fidelity estimation support and introduction of benchmarking interface (name TBD) #628

Merged

Corey Ostrove added 3 commits August 11, 2025 21:58

CombinedExperimentDesign Bug

20c3645

Fix a bug in the default subdesign naming of CombinedExperimentDesign. Previous used the '*' character which is a forbidden filename/directory character on windows, which led to problems writing to disk.

Update unit tests

f798bbe

Update the IBMQ unit tests for compatibility with newest QISKIT and add a pair of new integration tests with commonly used end-to-end IBMQ workflows (MRB and MCM GST).

Clean up commented out code

128ab78

Corey Ostrove added 3 commits August 11, 2025 23:10

Merge branch 'develop' into feature-svb-qol-updates

2ca867b

Patch parsing of float arguments in circuits

0a405bb

Fix a bug in parsing quantum circuits with arguments when using floats in scientific notation.

Update deserialization

b91158d

Update the deserialization of batch_results to return an empty list when deserialized as None.

Temporarily turn off dataframe tests

d6abea6

Some new changes in python 3.12 seem to have created an incompatibility with the dataframe conversion logic using pandas. Need to track this down, but for now temporarily turning this off.

coreyostrove approved these changes Aug 13, 2025

View reviewed changes

coreyostrove merged commit bcb8ac2 into develop Aug 13, 2025
4 checks passed

coreyostrove deleted the feature-svb-qol-updates branch August 13, 2025 16:14

coreyostrove mentioned this pull request Aug 15, 2025

Pandas-related incompatibility on python 3.12 with newer versions of pandas #636

Closed

coreyostrove mentioned this pull request Aug 22, 2025

Enable IBMQExperiment checkpointing #327

Closed

IBMQExperiment and FreeformDesign/CombinedExperimentDesign quality-of-life updates #379

IBMQExperiment and FreeformDesign/CombinedExperimentDesign quality-of-life updates #379

Uh oh!

Conversation

sserita commented Dec 2, 2023 • edited by coreyostrove Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Remaining Tasks

Uh oh!

rileyjmurray commented Dec 13, 2023

Uh oh!

coreyostrove Dec 14, 2023

Choose a reason for hiding this comment

Uh oh!

sserita Dec 15, 2023

Choose a reason for hiding this comment

Uh oh!

coreyostrove Dec 14, 2023

Choose a reason for hiding this comment

Uh oh!

sserita Dec 15, 2023

Choose a reason for hiding this comment

Uh oh!

sserita Dec 15, 2023

Choose a reason for hiding this comment

Uh oh!

coreyostrove commented Dec 15, 2023

Uh oh!

coreyostrove Dec 15, 2023

Choose a reason for hiding this comment

Uh oh!

sserita Dec 15, 2023

Choose a reason for hiding this comment

Uh oh!

coreyostrove left a comment

Choose a reason for hiding this comment

Uh oh!

pcwysoc commented Apr 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pcwysoc commented Apr 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ndsieki commented Aug 5, 2025

Uh oh!

ndsieki commented Aug 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coreyostrove commented Aug 12, 2025

Uh oh!

coreyostrove commented Aug 12, 2025

Uh oh!

coreyostrove commented Aug 12, 2025

Uh oh!

coreyostrove left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

pcwysoc commented Aug 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

sserita commented Dec 2, 2023 •

edited by coreyostrove

Loading

pcwysoc commented Apr 22, 2025 •

edited

Loading

pcwysoc commented Apr 22, 2025 •

edited

Loading

ndsieki commented Aug 6, 2025 •

edited

Loading