Dynamic dispatch alternative in Rust

Question

Dynamic dispatch alternative in Rust

457 views Asked by ccleve At 23 March 2023 at 19:38

I'm parsing a command string and creating an abstract syntax tree (AST) composed of nodes of different types. I'm trying to figure out an efficient way to execute the command.

This app is extremely performance sensitive. It's for a query engine that will need to process millions of items while the user waits.

The standard way to do this in a non-Rust language is to have each node type implement a common interface. To do this in Rust with a trait, though, you have to use awkward Box syntax, and that requires dynamic dispatch on every call. For example,

trait Node {
    fn get(&mut self) -> u32;
}

struct MyParentNode {
    my_int: u32,
    left: Box<dyn Node>,
    right: Box<dyn Node>,
}

impl Node for MyParentNode {
    fn get(&mut self) -> u32 {
        self.left.get() + self.right.get()
    }
}

struct MyLeafNode {
    my_int: u32,
}

impl Node for MyLeafNode {
    fn get(&mut self) -> u32 {
        self.my_int
    }
}

I have seen benchmarks that suggest that dynamic dispatch is very slow compared to calling a concrete function.

One alternative, not much better, is to use enums as node types:

enum NodeEnum {
    Parent(Box<ParentInfo>),
    Leaf(Box<LeafInfo>),
}

impl NodeEnum {
    fn get(&mut self) -> u32 {
        match self {
            NodeEnum::Parent(parent) => parent.get(),
            NodeEnum::Leaf(leaf) => leaf.get(),
        }
    }
}

struct ParentInfo {
    left: NodeEnum,
    right: NodeEnum,
}

impl ParentInfo {
    fn get(&mut self) -> u32 {
        self.left.get() + self.right.get()
    }
}

struct LeafInfo {
    my_int: u32,
}

impl LeafInfo {
    fn get(&mut self) -> u32 {
        self.my_int
    }
}

(The code above may not seem terrible, but in the real app there will be a couple dozen node types and a dozen methods on each node, so many calls to match).

There has to be a better way. Is there any way to implement this without the overhead of dynamic dispatch or having to call match on every call to get()?

Can function pointers help?

Original Q&A

There are 1 answers

**prog-fh** · Answer 1 · 2023-03-23T22:44:31+00:00

Please find below, a quick and dirty example around your question.

I'm aware the way I did the timing (in release mode, however, and with the help from cargo flamegraph) is not rigorous enough (some tooling exists for that purpose); this gives just a coarse grained feeling.

Two solutions are tested. The first one uses some dynamic dispatch. The second one uses the enum_dispatch crate in order to replace the virtual table by some (automated) match statements. There is a substantial gain in performances (4.7 vs 2.0 seconds in this specific case and on my computer).

Note that if the nodes in your actual use case do much more than the simple additions of this example, the dispatch cost may not be significant. In the end, only timing your actual use case could tell us if all of this is beneficial.

N.B.: in my first answer, I considered the creation of the nodes in the timings, because I don't know the use case. Obviously, much of the time was spent in allocating/freeing, and I tested a third version in order to mitigate that. On the other hand, if we consider that the nodes are built once for all and used many times (as in this new answer), then the dispatch cost becomes significant.

use enum_dispatch::enum_dispatch;

fn main() {
    let node_count = 200;
    let iter_count = 1_000_000;
    let warmup = iter_count / 100;
    with_dyn(node_count, iter_count, warmup);
    with_enum(node_count, iter_count, warmup);
}
/*
~~~~ with_dyn() ~~~~
4.712342453s
~~~~ with_enum() ~~~~
1.950779271s
*/

//~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

trait DynNodeTrait {
    fn get(&self) -> i32;
    fn change(
        &mut self,
        offset: i32,
    );
}

struct ParentDynNode {
    left: Box<dyn DynNodeTrait>,
    right: Box<dyn DynNodeTrait>,
}
impl DynNodeTrait for ParentDynNode {
    fn get(&self) -> i32 {
        self.left.get() + self.right.get()
    }

    fn change(
        &mut self,
        offset: i32,
    ) {
        self.left.change(offset);
        self.right.change(offset);
    }
}

struct LeafDynNode {
    my_int: i32,
}
impl DynNodeTrait for LeafDynNode {
    fn get(&self) -> i32 {
        self.my_int
    }

    fn change(
        &mut self,
        offset: i32,
    ) {
        self.my_int += offset;
    }
}

fn with_dyn(
    node_count: i32,
    iter_count: usize,
    warmup: usize,
) {
    println!("~~~~ with_dyn() ~~~~");
    let r1 = node_count * (node_count + 1) / 2;
    let r2 = r1 + 100 * node_count;
    let mut root: Box<dyn DynNodeTrait> = Box::new(LeafDynNode { my_int: 1 });
    for i in 2..=node_count {
        let leaf = Box::new(LeafDynNode { my_int: i });
        root = Box::new(ParentDynNode {
            left: root,
            right: leaf,
        });
    }
    let mut t0 = std::time::Instant::now();
    for iter in 0..iter_count {
        if iter < warmup {
            t0 = std::time::Instant::now();
        }
        let r = root.get();
        if r != r1 {
            println!("{} != {}", r, r1);
        }
        root.change(100);
        let r = root.get();
        if r != r2 {
            println!("{} != {}", r, r2);
        }
        root.change(-100);
    }
    let t1 = std::time::Instant::now();
    println!("{:?}", t1 - t0);
}

//~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

#[enum_dispatch]
trait EnumNodeTrait {
    fn get(&self) -> i32;
    fn change(
        &mut self,
        offset: i32,
    );
}

#[enum_dispatch(EnumNodeTrait)]
enum EnumNode {
    Parent(ParentEnumNode),
    Leaf(LeafEnumNode),
}

struct ParentEnumNode {
    left: Box<EnumNode>,
    right: Box<EnumNode>,
}
impl EnumNodeTrait for ParentEnumNode {
    fn get(&self) -> i32 {
        self.left.get() + self.right.get()
    }

    fn change(
        &mut self,
        offset: i32,
    ) {
        self.left.change(offset);
        self.right.change(offset);
    }
}

struct LeafEnumNode {
    my_int: i32,
}
impl EnumNodeTrait for LeafEnumNode {
    fn get(&self) -> i32 {
        self.my_int
    }

    fn change(
        &mut self,
        offset: i32,
    ) {
        self.my_int += offset;
    }
}

fn with_enum(
    node_count: i32,
    iter_count: usize,
    warmup: usize,
) {
    println!("~~~~ with_enum() ~~~~");
    let r1 = node_count * (node_count + 1) / 2;
    let r2 = r1 + 100 * node_count;
    let mut root = Box::new(EnumNode::from(LeafEnumNode { my_int: 1 }));
    for i in 2..=node_count {
        let leaf = Box::new(EnumNode::from(LeafEnumNode { my_int: i }));
        root = Box::new(EnumNode::from(ParentEnumNode {
            left: root,
            right: leaf,
        }));
    }
    let mut t0 = std::time::Instant::now();
    for iter in 0..iter_count {
        if iter < warmup {
            t0 = std::time::Instant::now();
        }
        let r = root.get();
        if r != r1 {
            println!("{} != {}", r, r1);
        }
        root.change(100);
        let r = root.get();
        if r != r2 {
            println!("{} != {}", r, r2);
        }
        root.change(-100);
    }
    let t1 = std::time::Instant::now();
    println!("{:?}", t1 - t0);
}

TechQA.

Dynamic dispatch alternative in Rust

There are 1 answers

Related Questions in RUST

Related Questions in DYNAMIC-DISPATCH

Popular Questions

Trending Questions