Experiment data fixes #1092

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

coruscating merged 55 commits into qiskit-community:main from gadial:experiment_data_fixes

May 11, 2023

Contributor

gadial commented Mar 19, 2023 •

edited

Loading

Summary

This PR handles some issues related to ExperimentData

Fixing a bug in _add_job_data.
Adding multi-upload capability to ExperimentData.save()
Different provider handling to enable better data loading
start_datetime and end_datetime are not being set at all, and creation_datetime and updated_datetime are being set only after loading the experiment from the server.

Details and comments

Currently _add_job_data is adding the result of a job without explicitly supplying its job_id. While in the old qiskit-ibmq-provider it was ok, in the new qiskit-ibm-provider it seems the job id contained in the Result object is different than the job id of the actual job itself. Since ExperimentData keeps the original job id, the result is that for every submitted job, it ends up with two different ids: One of a seemingly unfinished job, and the second for a job which was seemingly never initiated. This PR addresses this issue by using the original job id whenever possible.
ExperimentData.save() currently uploads both analysis results and figures one-by-one, with the result being inefficient which already affects other projects. This issue is handled in qiskit-ibm-experiments whose API was enlarged to allow multiple uploading of analysis results and figures; this PR enables this API usage in ExperimentData
ExperimentData.load() currently takes the experiment_id and an IBMExperimentService object. This has two setbacks: First, IBMExperimentService should be transparent to the users as much as possible. Second, IBMExperimentService handles the resultDB data, but the job data stored by ExperimentData is handled by the IBMProvider. This issue can be fixed by allowing the provider to be passed as parameter to load() since the service can be obtained from the provider. This change also fixes ExperimentData uses deprecated backend.retrieve_job method #1093.
start_datetime and end_datetime were not set by ExperimentData nor by the database itself. This PR makes the experiment data set start_datetime to the time it was created (unless another value is passed on creation; currently the BaseExperiment creates the experiment data right before beginning the experiment. Also, this PR makes every job update the end_datetime once it terminates. Along with that, calls to save() now update the values of creation_datetime and updated_datetime (which are set by the server). All the times are stored in UTC timezone, but the getters return them in local time, and the setters convert from local time to UTC.
ExperimentData.save() did not raise error in case no database service was available. Now it raises an error if suppress_errors is False.

gadial added 5 commits

March 19, 2023 10:49


          Bug fix. In the new backend provider, the job_id given in the Result …

335782b

…object is not the same as the original job_id, which should be used here.


          Experiment save now uses multiple analysis results create/update methods

6b45253


          Removed bulk update usage as it's not robust enough for now

b9acbf2


          Multi upload added for figures

c15b833


          Added a flag for disabling figure upload

71e82ea

yaelbh mentioned this pull request

ExperimentData uses deprecated backend.retrieve_job method #1093

Closed

gadial and others added 14 commits

March 28, 2023 14:07


          Different handling of provider to enable better data retrievel on load

014aca1


          Merge branch 'main' into experiment_data_fixes

0b0e474


          Merge branch 'experiment_data_fixes' of github.com:gadial/qiskit-expe…

50e5b21

…riments into experiment_data_fixes


          Remove import of qiskit-ibm-provider

8e80440


          Linting

a413828


          Fix a bug when setting provider

87a4ecf


          setting end_datetime when a job terminates.

884cee0


          _add_result_data can now handle missing job_id

5c9fc17


          Linting

d15e003


          Linting

3ed9dc6


          Merge branch 'main' into experiment_data_fixes

4fd5db8


          Bugfix

cc54c5f


          Bugfix in _retrieve_data() and test fix in test_save()

ed372c4


          Merge branch 'main' into experiment_data_fixes

8390c87

gadial changed the title ~~[WIP] Experiment data fixes~~ Experiment data fixes

gadial requested review from coruscating and yaelbh

March 29, 2023 09:10

coruscating reviewed

View reviewed changes

Collaborator

coruscating left a comment

Minor comments for now, will review more closely when we decide on end_datetime.

qiskit_experiments/framework/experiment_data.py Outdated Show resolved Hide resolved

qiskit_experiments/framework/experiment_data.py Outdated Show resolved Hide resolved

qiskit_experiments/framework/experiment_data.py

    
                  @staticmethod

                  def get_service_from_backend(backend):

                  def get_service_from_provider(provider):

Collaborator

coruscating Mar 29, 2023

You should update the documentation that uses get_service_from_backend if you're changing this interface.

Contributor Author

gadial Apr 17, 2023

I think I'll keep both.

qiskit_experiments/framework/experiment_data.py Outdated Show resolved Hide resolved

qiskit_experiments/framework/experiment_data.py Outdated

Comment on lines 966 to 967

    
                      # if self._result_data or not self._backend:

                      #     return

Collaborator

coruscating Mar 29, 2023

Why are these commented out?

Contributor Author

gadial Apr 17, 2023

That's actually a nontrivial issue. _retrieve_data is called from ExperimentData.load() which first initializes a new object via the line expdata = cls(service=service, db_data=data, provider=provider) and then calls expdata._retrieve_data(). However, when initializing the expdata, the _result_data field is also initialized to an empty thread safe dict, so it seems _retrieve_data will never run. I don't see why this line is here.

yaelbh requested changes

View reviewed changes

qiskit_experiments/framework/experiment_data.py Show resolved Hide resolved

qiskit_experiments/framework/experiment_data.py Show resolved Hide resolved

qiskit_experiments/framework/experiment_data.py Show resolved Hide resolved

qiskit_experiments/framework/experiment_data.py Outdated Show resolved Hide resolved

qiskit_experiments/framework/experiment_data.py Show resolved Hide resolved

qiskit_experiments/framework/experiment_data.py Outdated Show resolved Hide resolved

qiskit_experiments/framework/experiment_data.py Outdated Show resolved Hide resolved

qiskit_experiments/framework/experiment_data.py Outdated Show resolved Hide resolved

qiskit_experiments/framework/experiment_data.py Show resolved Hide resolved

qiskit_experiments/framework/experiment_data.py Outdated Show resolved Hide resolved

Collaborator

yaelbh commented Mar 29, 2023

Release notes are missing.

gadial and others added 4 commits

April 17, 2023 15:48


          Merge branch 'main' into experiment_data_fixes

ab11be8


          Fixes according to code review

638e177


          Linting

3c743e5


          Merge branch 'main' into experiment_data_fixes

757370b

gadial and others added 12 commits

May 3, 2023 10:12


          Code review fixes

2d1cc28


          Merge branch 'main' into experiment_data_fixes

dbde205


          Added support for automatically setting start_datetime

36fe440


          Merge branch 'experiment_data_fixes' of github.com:gadial/qiskit-expe…

a55a21b

…riments into experiment_data_fixes


          Better handling of hgp (verifies the backend is in the chosen hgp)

559d41f


          Created and updated datetime are now taken from the server's response

e21f54e


          Linting

f95dfce


          Bugfix

21be727


          All datetime is now stored as UTC, displayed as local

76c39b0


          Merge branch 'main' into experiment_data_fixes

28cbe17


          Updated relesae notes

16fe158


          Linting

127b4ed

yaelbh approved these changes

View reviewed changes

qiskit_experiments/framework/experiment_data.py Outdated Show resolved Hide resolved

qiskit_experiments/framework/experiment_data.py Outdated Show resolved Hide resolved

qiskit_experiments/framework/experiment_data.py Outdated Show resolved Hide resolved

qiskit_experiments/framework/experiment_data.py Outdated Show resolved Hide resolved

gadial and others added 5 commits

May 7, 2023 12:30


          Update qiskit_experiments/framework/experiment_data.py

c1edc72

Co-authored-by: Yael Ben-Haim <[email protected]>


          Update qiskit_experiments/framework/experiment_data.py

9dfd9f2

Co-authored-by: Yael Ben-Haim <[email protected]>


          Update qiskit_experiments/framework/experiment_data.py

092c65b

Co-authored-by: Yael Ben-Haim <[email protected]>


          Update qiskit_experiments/framework/experiment_data.py

1406a1a

Co-authored-by: Yael Ben-Haim <[email protected]>


          Small fix where experiment_type is being set to empty string instea…

36935bd

…d of `None` to avoid failing exp save due to this missing field.

Collaborator

coruscating commented May 10, 2023

I ran a test experiment, but creation_datetime and updated_datetime of the experiment data object were not populated after saving:

child_exp1 = T1(physical_qubits=(3,), delays=np.arange(1e-6, 10e-5, 3e-5))
child_exp2 = StandardRB(physical_qubits=(0,1), lengths=np.arange(1,100,20), num_samples=10)
child_exp3 = StandardRB(physical_qubits=(4,5), lengths=np.arange(1,100,20), num_samples=10)
parallel_exp = ParallelExperiment([child_exp1, child_exp2], flatten_results=True)
batch_exp = BatchExperiment([parallel_exp, child_exp3], flatten_results=True)
parallel_data = batch_exp.run(backend, seed_simulator=101).block_for_results()
parallel_data.save()

# these are still None
print(parallel_data.creation_datetime)
print(parallel_data.updated_datetime)

Also, after loading the experiment data object from ResultsDB, the start_datetime has changed to be later than the start_datetime of the original experiment data object, such that creation_datetime is actually earlier than start_datetime in this test:

end_datetime is nearly the same, I assume the small deviation is due to the server and local clock time difference which is fine.

Contributor Author

gadial commented May 10, 2023

I ran a test experiment, but creation_datetime and updated_datetime of the experiment data object were not populated after saving:

You're not seeing creation_datetime updating probably because you need to update your version of qiskit-ibm-experiment to 0.3.1; I needed a minor fix there for this to work.

updated_datetime is a slightly different story because it is not returned from the server when a new experiment is created, so we'll need to change this one server-side. Try doing parallel_data.save() twice and you'll see the correct value for updated_datetime (given you're using qiskit-ibm-experiment 0.3.1).

gadial added 5 commits

May 10, 2023 09:40


          Merge branch 'main' into experiment_data_fixes

f656498


          Bugfixes related to start_datetime


          Merge branch 'main' into experiment_data_fixes

0e7c89a


          Linting

517e715


          Merge branch 'main' into experiment_data_fixes

d701b47

coruscating approved these changes

View reviewed changes

Collaborator

coruscating left a comment

Thanks for fixing the issues. There are a few remaining problems when running a composite experiment with flatten_results=False:

The child experiments don't have end_datetimes while the parent experiment does
Only one child experiment will have updated_datetime while the other child experiments and the parent don't

These should be addressed in a follow-up PR.

coruscating enabled auto-merge

May 11, 2023 20:38

coruscating added this pull request to the merge queue

Merged via the queue into qiskit-community:main with commit 6a732f4

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet