.. DO NOT EDIT. .. THIS FILE WAS AUTOMATICALLY GENERATED BY SPHINX-GALLERY. .. TO MAKE CHANGES, EDIT THE SOURCE PYTHON FILE: .. "auto_examples/memory_basic_usage.py" .. LINE NUMBERS ARE GIVEN BELOW. .. only:: html .. note:: :class: sphx-glr-download-link-note :ref:`Go to the end ` to download the full example code. .. rst-class:: sphx-glr-example-title .. _sphx_glr_auto_examples_memory_basic_usage.py: ======================== How to use joblib.Memory ======================== This example illustrates the usage of :class:`joblib.Memory` with both functions and methods. .. GENERATED FROM PYTHON SOURCE LINES 12-17 Without :class:`joblib.Memory` ############################################################################## ``costly_compute`` emulates a computationally expensive process which later will benefit from caching using :class:`joblib.Memory`. .. GENERATED FROM PYTHON SOURCE LINES 17-28 .. code-block:: Python import time import numpy as np def costly_compute(data, column_index=0): """Simulate an expensive computation""" time.sleep(5) return data[column_index] .. GENERATED FROM PYTHON SOURCE LINES 29-32 Be sure to set the random seed to generate deterministic data. Indeed, if the data is not deterministic, the :class:`joblib.Memory` instance will not be able to reuse the cache from one run to another. .. GENERATED FROM PYTHON SOURCE LINES 32-42 .. code-block:: Python rng = np.random.RandomState(42) data = rng.randn(int(1e5), 10) start = time.time() data_trans = costly_compute(data) end = time.time() print('\nThe function took {:.2f} s to compute.'.format(end - start)) print('\nThe transformed data are:\n {}'.format(data_trans)) .. rst-class:: sphx-glr-script-out .. code-block:: none The function took 5.00 s to compute. The transformed data are: [ 0.49671415 -0.1382643 0.64768854 1.52302986 -0.23415337 -0.23413696 1.57921282 0.76743473 -0.46947439 0.54256004] .. GENERATED FROM PYTHON SOURCE LINES 43-50 Caching the result of a function to avoid recomputing ############################################################################## If we need to call our function several time with the same input data, it is beneficial to avoid recomputing the same results over and over since it is expensive. :class:`joblib.Memory` enables to cache results from a function into a specific location. .. GENERATED FROM PYTHON SOURCE LINES 50-70 .. code-block:: Python from joblib import Memory location = './cachedir' memory = Memory(location, verbose=0) def costly_compute_cached(data, column_index=0): """Simulate an expensive computation""" time.sleep(5) return data[column_index] costly_compute_cached = memory.cache(costly_compute_cached) start = time.time() data_trans = costly_compute_cached(data) end = time.time() print('\nThe function took {:.2f} s to compute.'.format(end - start)) print('\nThe transformed data are:\n {}'.format(data_trans)) .. rst-class:: sphx-glr-script-out .. code-block:: none The function took 5.01 s to compute. The transformed data are: [ 0.49671415 -0.1382643 0.64768854 1.52302986 -0.23415337 -0.23413696 1.57921282 0.76743473 -0.46947439 0.54256004] .. GENERATED FROM PYTHON SOURCE LINES 71-74 At the first call, the results will be cached. Therefore, the computation time corresponds to the time to compute the results plus the time to dump the results into the disk. .. GENERATED FROM PYTHON SOURCE LINES 74-82 .. code-block:: Python start = time.time() data_trans = costly_compute_cached(data) end = time.time() print('\nThe function took {:.2f} s to compute.'.format(end - start)) print('\nThe transformed data are:\n {}'.format(data_trans)) .. rst-class:: sphx-glr-script-out .. code-block:: none The function took 0.01 s to compute. The transformed data are: [ 0.49671415 -0.1382643 0.64768854 1.52302986 -0.23415337 -0.23413696 1.57921282 0.76743473 -0.46947439 0.54256004] .. GENERATED FROM PYTHON SOURCE LINES 83-86 At the second call, the computation time is largely reduced since the results are obtained by loading the data previously dumped to the disk instead of recomputing the results. .. GENERATED FROM PYTHON SOURCE LINES 88-94 Using :class:`joblib.Memory` with a method ############################################################################## :class:`joblib.Memory` is designed to work with functions with no side effects. When dealing with class, the computationally expensive part of a method has to be moved to a function and decorated in the class method. .. GENERATED FROM PYTHON SOURCE LINES 94-120 .. code-block:: Python def _costly_compute_cached(data, column): time.sleep(5) return data[column] class Algorithm(object): """A class which is using the previous function.""" def __init__(self, column=0): self.column = column def transform(self, data): costly_compute = memory.cache(_costly_compute_cached) return costly_compute(data, self.column) transformer = Algorithm() start = time.time() data_trans = transformer.transform(data) end = time.time() print('\nThe function took {:.2f} s to compute.'.format(end - start)) print('\nThe transformed data are:\n {}'.format(data_trans)) .. rst-class:: sphx-glr-script-out .. code-block:: none The function took 5.02 s to compute. The transformed data are: [ 0.49671415 -0.1382643 0.64768854 1.52302986 -0.23415337 -0.23413696 1.57921282 0.76743473 -0.46947439 0.54256004] .. GENERATED FROM PYTHON SOURCE LINES 121-129 .. code-block:: Python start = time.time() data_trans = transformer.transform(data) end = time.time() print('\nThe function took {:.2f} s to compute.'.format(end - start)) print('\nThe transformed data are:\n {}'.format(data_trans)) .. rst-class:: sphx-glr-script-out .. code-block:: none The function took 0.01 s to compute. The transformed data are: [ 0.49671415 -0.1382643 0.64768854 1.52302986 -0.23415337 -0.23413696 1.57921282 0.76743473 -0.46947439 0.54256004] .. GENERATED FROM PYTHON SOURCE LINES 130-132 As expected, the second call to the ``transform`` method load the results which have been cached. .. GENERATED FROM PYTHON SOURCE LINES 134-136 Clean up cache directory ############################################################################## .. GENERATED FROM PYTHON SOURCE LINES 136-138 .. code-block:: Python memory.clear(warn=False) .. rst-class:: sphx-glr-timing **Total running time of the script:** (0 minutes 15.092 seconds) .. _sphx_glr_download_auto_examples_memory_basic_usage.py: .. only:: html .. container:: sphx-glr-footer sphx-glr-footer-example .. container:: sphx-glr-download sphx-glr-download-jupyter :download:`Download Jupyter notebook: memory_basic_usage.ipynb ` .. container:: sphx-glr-download sphx-glr-download-python :download:`Download Python source code: memory_basic_usage.py ` .. only:: html .. rst-class:: sphx-glr-signature `Gallery generated by Sphinx-Gallery `_