Multimodal AI built with Tar Heel research
Working with a team at Microsoft Research, Carolina computer science professor Mohit Bansal and his student Zineng Tang, a Microsoft intern, created the CoDi AI system — a model capable of generating any combination of outputs (e.g, text, images, videos, audio) from any combination of inputs. Microsoft Research featured the project on its website last summer, and a few months later, the team presented the revamped CoDi-2 to much fanfare.