Grove AI HAT for Edge Computing

Is it possible to upload a generative adversarial network (GAN) model, trained on a custom image set, and have video/audio data be processed in real time? I would like to start with video data of 720p, audio 44kHz, for a total of voice alteration, face detection, object recognition and finally image cropping. Does the SDK allow the use of custom-programmed models?



Another application I would like to ponder is semantic segmentation (i.e. separation of a live video scene into objects, foreground, background etc).



How does the compute power of the Sipeed MAix compare to other deep learning hardware like GPUs? Do you have a benchmark comparison like with an NVIDIA gaming GPU?