Add `cache_method` decorator #895

chriseclectic · 2022-08-19T16:14:52Z

Summary

Adds a cache_method decorator that generalizes lru_cache for caching methods of class instances (since apparently lru_cache can have memory leaks when used in this way, if it works)

This is based on suggested solution for caching experiment methods in this comment

Details and comments

By default this decorator requires all method arg and kwarg values to be hashable, and they are included in the cache key for matching. However setting cache_args=False on the decorator will ignore args and kwargs and match only on the method name. Alternatively the decorator can be called with require_hashable=False which will allow non-hashable args while matching on all hashable args and kwargs.

nkanazawa1989

Thanks Chris. This suggestion seems good direction, but still we need to allow for some flexibility. For example, current framework is missing the capability to check experiment options self.experiment_options, and to cache outcomes across instances. This is beyond the requirements by the tomography fitter, but we must provide flexible API so that #878 can update the mechanism based on their needs.

qiskit_experiments/framework/cache_method.py

nkanazawa1989 · 2022-08-20T02:58:54Z

qiskit_experiments/framework/cache_method.py

+
+
+def cache_method(
+    cache: Union[Dict, str] = "_cache", cache_args: bool = True, require_hashable: bool = True


The combination of two booleans is bit harder to understand. Perhaps string based behavior makes the interface more intuitive, such as first_time, hash_all, only_hashable. This will allow more flexibility for hashing mechanism. For example, another option we may want to have would be repr(arg) to make everything hashable, e.g. try_repr.

Personally I find the booleans easier to understand than string values. I don't think we should add more flexibility in hashing mechanism. If anything maybe we should remove the require_hashable should be removed so the option is just to use all args (and they must be hashable) or none.

If you remove that option perhaps this is only applicable to static methods? Because self of an experiment instance is not hashable.

self not being hashed is the design of this decorator, and main reason why you need it instead of lru_cache. So it should only be used with regular methods, not static methods. For static methods you should be able to use a regular lru_cache without issues since it behaves like a regular function.

PS @nkanazawa1989 I will make a commit to remove the require_hashable kwarg to not overcomplicate this as you suggest.

qiskit_experiments/framework/cache_method.py

test/framework/test_cache_method.py

qiskit_experiments/framework/cache_method.py

chriseclectic · 2022-08-21T22:18:31Z

@nkanazawa1989 I think there is a bit of a misunderstanding, this decorator is intended to be a replacement for lru_cache that works properly with methods, and adds some more flexibility of whether to include arg values or not when matching cached values. It has nothing to directly do with caching in BaseExperiment, (but an implementation of caching of transpiled circuits and things in BaseExperiments could be implemented using this functionality eg like i wrote in the other PR comment).

nkanazawa1989 · 2022-08-22T00:31:47Z

If this is just a bug fix of python lru_cache then I think this code should go to that repository. Since the arguments of your cache function differs from the functools lru_cache, I thought you wanted some customization for our experiment. This means it doesn't stop us to add some flexibility for our experiment as long as it doesn't hurt the performance (e.g. formatting and inspection must be avoided). I think having flexibility of checking experiment options out of experiment instance (self) is something reasonable to do for our experiment module. Perhaps boolean still works to do this with check_experiment_options: bool=False.

yaelbh · 2022-08-22T08:47:45Z

I'm not familiar with the tomography use case.

I'm aware of several use cases where we want to refrain from retranspiling. To this end, what we need is not a caching mechanism. Instead, it is sufficient to:

Expose _transpile_circuits (i.e., rename it to transpile_circuits).
Let run accept a transpiled_circuits input parameter.

nkanazawa1989 · 2022-08-22T09:00:02Z

Sounds like #878 can be simply closed in that case. I don't have any method in BaseExperiment and its subclasses that can be drastically improved with cache. Exposing transpiled circuit is not the scope of #878.

(edit)
This makes me think the PR is good to go as it is.

This supports caching regular methods of class instances with optionally support for including hashable arg values in cache key.

Define function for returning the method cache dict outside of the wrapped method so it doesn't need to be checked every method call.

chriseclectic · 2022-08-23T14:59:05Z

qiskit_experiments/framework/cache_method.py

+
+        def _cache_fn(instance, method):
+            # pylint: disable = unused-argument
+            name = method.__name__


@nkanazawa1989 I wonder if this should be method.__qualname__ instead? qualname includes the class name in the string like <cls_name>.<method_name>, rather than just <method_name>.

I think qualname should be useful if we want to support class level cache in future. Also you can validate that the method is not a function.

yaelbh · 2022-08-23T19:00:55Z

The proposal of #895 (comment) may require more thinking, because run does things before the transpilation that may be relevant to the transpilation.

Still, I'd like to revisit the caching mechanism that you're building here. You're putting effort in it, and we'll have to maintain it. What's the motivation? Is it justified? It seems to be beyond the scope of qiskit-experiments.

The reuse of transpiled circuits is a modest goal that can do with a small solution. Probably along the lines of #895 (comment), or something else of the same magnitude.

nkanazawa1989 · 2022-08-29T00:51:49Z

This is a cache mechanism to store some internal state. In QE, some helper function (method) might be called multiple times thus sometimes it's better to cache them for performance. For example, a dedicated logic that caches the transpiled circuit doesn't speed up the analysis class, and we may need huge memory space if user generate circuits with multiple settings (if we do in-memory cache).

As Chris wrote in the PR comment, this is mainly to overcome memory leak issue in current python RLU cache (I don't know the details). I think currently Chris is looking into different approach.

chriseclectic · 2022-12-14T04:09:13Z

Closing this for #997

chriseclectic force-pushed the cache-method branch from 08359bf to 3c5fe40 Compare August 19, 2022 16:39

nkanazawa1989 reviewed Aug 21, 2022

View reviewed changes

chriseclectic added 7 commits August 23, 2022 10:55

Add cache_method decorator

60bdd54

This supports caching regular methods of class instances with optionally support for including hashable arg values in cache key.

Add tests

b00ed57

Swap basis class lru_cache to cache_method

a258f72

Add reno

06f50d1

Move cache type check outside of decorated method

51f6c1f

Define function for returning the method cache dict outside of the wrapped method so it doesn't need to be checked every method call.

Improve code organization

5592d2b

remove require_hashable kwarg, improve docs

a397fcf

chriseclectic force-pushed the cache-method branch from 3c5fe40 to a397fcf Compare August 23, 2022 14:55

chriseclectic commented Aug 23, 2022

View reviewed changes

chriseclectic closed this Dec 14, 2022



		def cache_method(
		cache: Union[Dict, str] = "_cache", cache_args: bool = True, require_hashable: bool = True

Add cache_method decorator #895

Add cache_method decorator #895

Uh oh!

Conversation

chriseclectic commented Aug 19, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Details and comments

Uh oh!

nkanazawa1989 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

nkanazawa1989 Aug 20, 2022

Choose a reason for hiding this comment

Uh oh!

chriseclectic Aug 21, 2022

Choose a reason for hiding this comment

Uh oh!

nkanazawa1989 Aug 22, 2022

Choose a reason for hiding this comment

Uh oh!

chriseclectic Aug 23, 2022

Choose a reason for hiding this comment

Uh oh!

chriseclectic Aug 23, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chriseclectic commented Aug 21, 2022

Uh oh!

nkanazawa1989 commented Aug 22, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yaelbh commented Aug 22, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nkanazawa1989 commented Aug 22, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chriseclectic Aug 23, 2022

Choose a reason for hiding this comment

Uh oh!

nkanazawa1989 Aug 23, 2022

Choose a reason for hiding this comment

Uh oh!

yaelbh commented Aug 23, 2022

Uh oh!

nkanazawa1989 commented Aug 29, 2022

Uh oh!

chriseclectic commented Dec 14, 2022

Uh oh!

Uh oh!

Add `cache_method` decorator #895

Add `cache_method` decorator #895

chriseclectic commented Aug 19, 2022 •

edited

Loading

nkanazawa1989 commented Aug 22, 2022 •

edited

Loading

yaelbh commented Aug 22, 2022 •

edited

Loading

nkanazawa1989 commented Aug 22, 2022 •

edited

Loading