After a bit of digging and some help on GitHub (thanks LynnL4) I managed to figure out the communication protocols.
Essentially all detection data bounding boxes, classifications etc comes through MQTT. There is a RTSP server but the detection data is not present in the events channel.