.. cpp:namespace:: nanobind
Python and C++ don't manage the lifetime and storage of objects in the same way. Consequently, two questions arise whenever an object crosses the language barrier:
- Who actually owns this object? C++? Python? Both?!
- Can we safely determine when it is no longer needed?
This is important: we must exclude the possibility that Python destroys an object that is still being used by C++ (or vice versa).
The :ref:`previous section <exchange>` introduced three ways of exchanging information between C++ and Python: :ref:`type casters <type_casters>`, :ref:`bindings <bindings>`, and :ref:`wrappers <wrappers>`. It is specifically :ref:`bindings <bindings>` for which these two questions must be answered.
Consider the following problematic example to see what can go wrong:
#include <nanobind/nanobind.h>
namespace nb = nanobind;
struct Data { };
Data data; // Data global variable & function returning a pointer to it
Data *get_data() { return &data; }
NB_MODULE(my_ext, m) {
nb::class_<Data>(m, "Data");
// KABOOM, calling this function will crash the Python interpreter
m.def("get_data", &get_data);
}
The bound function my_ext.get_data()
returns a Python object of type
my_ext.Data
that wraps the pointer &data
and takes ownership of it.
When Python eventually garbage collects the object, nanobind will try to free
the (non-heap-allocated) C++ instance via operator delete
, causing a
segmentation fault.
To avoid this problem, we can
- Provide more information: the problem was that nanobind incorrectly transferred ownership of a C++ instance to the Python side. To fix this, we can add add a :ref:`return value policy <rvp>` annotation that clarifies what to do with the return value.
- Make ownership transfer explicit: C++ types passed via :ref:`unique
pointers <unique_ptr>` (
std::unique_ptr<T>
) make the ownership transfer explicit in the type system, which would have revealed the problem in this example. - Switch to shared ownership: C++ types passed via :ref:`shared pointers
<shared_ptr>` (
std::shared_ptr<T>
), or which use :ref:`intrusive reference counting <intrusive_intro>` can be shared by C++ and Python. The whole issue disappears because ownership transfer is no longer needed.
The remainder of this section goes through each of these options.
nanobind provides several return value policy annotations that can be passed to :func:`module_::def`, :func:`class_::def`, and :func:`cpp_function`. The default policy is :cpp:enumerator:`rv_policy::automatic`, which is usually a reasonable default (but not in this case!).
In the :ref:`problematic example <ownership_problem>`, the policy :cpp:enumerator:`rv_policy::reference` should have been specified explicitly so that the global instance is only referenced without any implied transfer of ownership, i.e.:
m.def("get_data", &get_data, nb::rv_policy::reference);
On the other hand, this is not the right policy for many other situations, where ignoring ownership could lead to resource leaks. As a developer using this library, it is important that you familiarize yourself with the different options below. In particular, the following policies are available:
:cpp:enumerator:`rv_policy::take_ownership`: Create a thin Python object wrapper around the returned C++ instance without making a copy and transfer ownership to Python. When the Python wrapper is eventually garbage collected, nanobind will call the C++
delete
operator to free the C++ instance.In the example below, a function uses this policy to transfer ownership of a heap-allocated C++ instance to Python:
m.def("make_data", []{ return new Data(); }, nb::rv_policy::take_ownership);
The return value policy declaration could actually have been omitted here because :cpp:enumerator:`take_ownership <rv_policy::take_ownership>` is the default for pointer return values (see :cpp:enumerator:`automatic <rv_policy::automatic>`).
:cpp:enumerator:`rv_policy::copy`: Copy-construct a new Python object from the C++ instance. The copy will be owned by Python, while C++ retains ownership of the original.
In the example below, a function uses this policy to return a reference to a C++ instance. The owner and lifetime of such a reference may not be clear, so the safest route is to make a copy.
struct A { B &b() { /* .. unknown code .. */ } }; nb::class_<A>(m, "A") .def("b", &A::b, nb::rv_policy::copy);
The return value policy declaration could actually have been omitted here because :cpp:enumerator:`copy <rv_policy::copy>` is the default for lvalue reference return values (see :cpp:enumerator:`automatic <rv_policy::automatic>`).
:cpp:enumerator:`rv_policy::move`: Move-construct a new Python object from the C++ instance. The new object will be owned by Python, while C++ retains ownership of the original (whose contents were likely invalidated by the move operation).
In the example below, a function uses this policy to return a C++ instance by value. The :cpp:enumerator:`copy <rv_policy::copy>` operation mentioned above would also be safe to use, but move construction has the potential of being significantly more efficient.
struct A { B b() { return B(...); } }; nb::class_<A>(m, "A") .def("b", &A::b, nb::rv_policy::move);
The return value policy declaration could actually have been omitted here because :cpp:enumerator:`move <rv_policy::move>` is the default for functions that return by value (see :cpp:enumerator:`automatic <rv_policy::automatic>`).
:cpp:enumerator:`rv_policy::reference`: Create a thin Python object wrapper around the returned C++ instance without making a copy, but do not transfer ownership to Python. nanobind will never call the C++
delete
operator, even when the wrapper expires. The C++ side is responsible for destructing the C++ instance.This return value policy is dangerous and should be used cautiously. Undefined behavior will ensue when the C++ side deletes the instance while it is still being used by Python. If you need to use this policy, combine it with a :cpp:struct:`keep_alive` function binding annotation to manage the lifetime. Or use the simple and safe :cpp:enumerator:`reference_internal <rv_policy::reference_internal>` alternative described next.
Below is an example use of this return value policy to reference a global variable that does not need ownership and lifetime management.
Data data; // This is a global variable m.def("get_data", []{ return &data; }, nb::rv_policy::reference)
:cpp:enumerator:`rv_policy::reference_internal`: A policy for methods that expose an internal field. The lifetime of the field must match that of the parent object.
The policy resembles :cpp:enumerator:`reference <rv_policy::reference>` in that it creates creates a thin Python object wrapper around the returned C++ field without making a copy, and without transferring ownership to Python.
Furthermore, it ensures that the instance owning the field (implicit
this
/self
argument) cannot be garbage collected while an object representing the field is alive.The example below uses this policy to implement a getter that permits mutable access to an internal field.
struct MyClass { public: MyField &field() { return m_field; } private: MyField m_field; }; nb::class_<MyClass>(m, "MyClass") .def("field", &MyClass::field, nb::rv_policy::reference_internal);
More advanced variations of this scheme are also possible using combinations of :cpp:enumerator:`reference <rv_policy::reference>` and the :cpp:struct:`keep_alive` function binding annotation.
:cpp:enumerator:`rv_policy::none`: This is the most conservative policy: it simply refuses the cast unless the C++ instance already has a corresponding Python object, in which case the question of ownership becomes moot.
:cpp:enumerator:`rv_policy::automatic`: This is the default return value policy, which falls back to :cpp:enumerator:`take_ownership <rv_policy::automatic>` when the return value is a pointer, :cpp:enumerator:`move <rv_policy::move>` when it is a rvalue reference, and :cpp:enumerator:`copy <rv_policy::copy>` when it is a lvalue reference.
:cpp:enumerator:`rv_policy::automatic_reference`: This policy matches :cpp:enumerator:`automatic <rv_policy::automatic>` but falls back to :cpp:enumerator:`reference <rv_policy::refernece>` when the return value is a pointer. It is the default for function arguments when calling Python functions from C++ code via :cpp:func:`detail::api::operator()`. You probably won't need to use this policy in your own code.
Passing a STL unique pointer embodies an ownership transfer---a return value
policy annotation is therefore not needed. To bind functions that receive or
return std::unique_ptr<..>
, add the extra include directive
#include <nanobind/stl/unique_ptr.h>
Note
While this this header file technically contains a :ref:`type caster <type_casters>`, it is not affected by their usual limitations (mandatory copy/conversion, inability to mutate function arguments).
Example: The following example binds two functions that create and consume
instances of a C++ type Data
via unique pointers.
#include <nanobind/stl/unique_ptr.h>
namespace nb = nanobind;
NB_MODULE(my_ext, m) {
struct Data { };
nb::class_<Data>(m, "Data");
m.def("create", []() { return std::make_unique<Data>(); });
m.def("consume", [](std::unique_ptr<Data> x) { /* no-op */ });
}
Calling a function taking a unique pointer from Python invalidates the passed Python object. nanobind will refuse further use of it:
Python 3.11.1 (main, Dec 23 2022, 09:28:24) [Clang 14.0.0 (clang-1400.0.29.202)] on darwin
Type "help", "copyright", "credits" or "license" for more information.
>>> import my_ext
>>> x = my_ext.create()
>>> my_ext.consume(x)
>>> my_ext.consume(x)
<stdin>:1: RuntimeWarning: nanobind: attempted to access an uninitialized instance of type 'my_ext.Data'!
TypeError: consume(): incompatible function arguments. The following argument types are supported:
1. consume(arg: my_ext.Data, /) -> None
Invoked with types: my_ext.Data
We strongly recommend that you replace all use of std::unique_ptr<T>
by
std::unique_ptr<T, nb::deleter<T>>
in your code. Without the latter type
declaration, which references a custom nanobind-provided deleter
:cpp:class:`nb::deleter\<T\> <deleter>`, nanobind cannot transfer ownership of
objects constructed using :cpp:class:`nb::init\<...\> <init>` to C++ and will
refuse to do so with an error message. Further detail on this special case can
be found in the advanced :ref:`section <unique_ptr_adv>` on object ownership.
In a shared ownership model, an object can have multiple owners that each register their claim by holding a reference. The system keeps track of the total number of references and destroys the object once the count reaches zero. Passing such an object in a function call shares ownership between the caller and callee. nanobind makes this behavior seamless so that everything works regardless of whether caller/callee are written in C++ or Python.
STL shared pointers (std::shared_ptr<T>
) allocate a separate control block to
keep track of the reference count, which makes them very general but also slightly
less efficient than other alternatives.
nanobind's support for shared pointers requires an extra include directive:
#include <nanobind/stl/shared_ptr.h>
Note
While this this header file technically contains a :ref:`type caster <type_casters>`, it is not affected by their usual limitations (mandatory copy/conversion, inability to mutate function arguments).
You don't need to specify a return value policy annotation when a function returns a shared pointer.
nanobind's implementation of std::shared_ptr
support typically
allocates a new shared_ptr
control block each time a Python object
must be converted to std::shared_ptr<T>
. The new shared_ptr
"owns" a reference to the Python object, and its deleter drops that
reference. This has the advantage that the Python portion of the
object will be kept alive by its C++-side references (which is
important when implementing C++ virtual methods in Python), but it can
be inefficient when passing the same object back and forth between
Python and C++ many times, and it means that the use_count()
method of std::shared_ptr
will return a value that does not
capture all uses. Some of these problems can be mitigated by modifying
T
so that it inherits from std::enable_shared_from_this<T>
.
See the :ref:`advanced section <shared_ptr_adv>` on object ownership
for more details on the implementation.
nanobind has limited support for objects that inherit from
std::enable_shared_from_this<T>
to allow safe conversion of raw
pointers to shared pointers. The safest way to deal with these objects
is to always use std::make_shared<T>(...)
when constructing them in C++,
and always pass them across the Python/C++ boundary wrapped in an explicit
std::shared_ptr<T>
. If you do this, then there shouldn't be any
surprises. If you will be passing raw T*
pointers around, then
read the :ref:`advanced section on object ownership <enable_shared_from_this>`
for additional caveats.
Intrusive reference counting is the most flexible and efficient way of handling shared ownership. The main downside is that you must adapt the base class of your object hierarchy to the needs of nanobind.
The core idea is to define base class (e.g. Object
) common to all bound
types requiring shared ownership. That class contains a builtin atomic counter
(e.g., m_ref_count
) and a Python object pointer (e.g., m_py_object
).
class Object {
...
private:
mutable std::atomic<size_t> m_ref_count { 0 };
PyObject *m_py_object = nullptr;
};
The core idea is that such Object
instances can either be managed by C++ or
Python. In the former case, the m_ref_count
field keeps track of the number
of outstanding references. In the latter case, reference counting is handled by
Python, and the m_ref_count
field remains unused.
This is actually little wasteful---nanobind therefore ships with a more
efficient reference counter sample implementation that supports both use cases
while requiring only sizeof(void*)
bytes of storage:
#include <nanobind/intrusive/counter.h>
class Object {
...
private:
intrusive_counter m_ref_count;
};
Please read the dedicated :ref:`section on intrusive reference counting <intrusive>` for more details on how to set this up.