Multimodal Scene Understanding System sync_timseries_and_prompt_AV.ipynb provides an example of fusing Audio Na Video data to help in a Caregiving facility An overview of the system's working