Working with expensive notebooks¶
marimo provides tools to control when cells run. Use these tools to prevent expensive cells, which may call APIs or take a long time to run, from accidentally running.
Stop execution with mo.stop
¶
Use mo.stop
to stop a cell from executing if a condition
is met:
# if condition is True, the cell will stop executing after mo.stop() returns
mo.stop(condition)
# this won't be called if condition is True
expensive_function_call()
Use mo.stop
with
mo.ui.run_button()
to require a button press for
expensive cells:
Configure how marimo runs cells¶
Disable cell autorun¶
If you habitually work with very expensive notebooks, you can disable automatic execution. When automatic execution is disabled, when you run a cell, marimo marks dependent cells as stale instead of running them automatically.
Disable autorun on startup¶
marimo autoruns notebooks on startup, with marimo edit notebook.py
behaving
analogously to python notebook.py
. This can also be disabled through the
notebook settings.
Disable individual cells¶
marimo lets you temporarily disable cells from automatically running. This is helpful when you want to edit one part of a notebook without triggering execution of other parts. See the reactivity guide for more info.
Manage memory¶
Here are a few tips for managing the memory consumption of your notebooks, on host or GPU.
Wrap intermediate computations in functions¶
By default, global variables live in the kernel memory. Intermediate variables that are defined in functions are cleaned up automatically.
For example, if X
is a temporary:
Do this:
Don't do this:
Use del
to remove variables from kernel memory¶
Use the del
operator to remove variables from kernel memory.
In a single cell. Prefer deleting variables in the cell they were defined
in. For example,
if X
is a temporary that you don't need after computing Y
:
In another cell. Sometimes, computations are spread across multiple cells,
and you only realize later on that you need to free memory that you've already
allocated. In such cases you can still use the del
keyword. For example:
marimo inserts control dependences to make sure that variables are not deleted
before they are used. When del
is used to delete a variable that was defined
in a another cell, the cell where del
was used becomes a child of all other
cells that reference that variable. In this case, that means marimo knows to
run the third cell after the second cell, since the second cell references
data
and the third cell deletes it. However, once data
is deleted,
attempting to manually run the second cell will raise a NameError
, and you'll
need to re-run the defining cell in order to get your notebook back to a
consistent state.
Automatically snapshot outputs as HTML or IPYNB¶
To keep a record of your cell outputs while working on your
notebook, you can configure notebooks to automatically save as HTML or ipynb
through the notebook menu (these files are saved in addition to the
notebook's .py
file). Snapshots are saved to a folder called
__marimo__
in the notebook directory.
Learn more about exporting notebooks in our exporting guide.
Cache expensive computations¶
marimo provides two decorators to cache the return values of expensive functions:
- In-memory caching with
mo.cache
- Disk caching with
mo.persistent_cache
Both utilities can be used as decorators or context managers.
See our guide on caching for details, including how the cache key is constructed, and limitations.
Lazy-load expensive UIs¶
Lazily render UI elements that are expensive to compute using
marimo.lazy
.
For example,
In this example, mo.ui.table(data)
will not be rendered on the frontend until is it in the viewport.
For example, an element can be out of the viewport due to scroll, inside a tab that is not selected, or inside an accordion that is not open.
However, in this example, data is eagerly computed, while only the rendering of the table is lazy. It is possible to lazily compute the data as well: see the next example.
import marimo as mo
def expensive_component():
import time
time.sleep(1)
data = db.query("SELECT * FROM data")
return mo.ui.table(data)
accordion = mo.accordion({
"Charts": mo.lazy(expensive_component)
})
In this example, we pass a function to mo.lazy
instead of a component. This
function will only be called when the user opens the accordion. In this way,
expensive_component
lazily computed and we only query the database when the
user needs to see the data. This can be useful when the data is expensive to
compute and the user may not need to see it immediately.