The current AI B-Roll seems to just add 2 media near the beginning of the videos. It's a cool concept but could be way better. Especially for videos that are longer length.
I suggest you make it work like this:
Ai reads the content. it highlights the key words/topic of each sentence or paragraph.
Then based on the topic/keyword of the sentence/paragraph/every 3 sentences, It produces the b-roll and with an option to set a default length. (3 seconds)
That way you have B roll all through the videos and it will be ON POINT, not something random!
I pretty much do this manually with minivo and its an easy and fast way to get b roll. If we automate the process, OOOOF this will be KING!