What is the difference between ClipAnything vs. ClipBasic? - Opus Clip

ClipAnything surpasses ClipBasic in all aspects, specifically:

	ClipAnything	ClipBasic
Video type supported	Extending beyond just talking-head videos, ClipAnything can clip any type of video, including but not limited to vlogs, sports, TV shows, behind-the-scenes, news, music, and videos with little to no dialogue.	Can only clip talking-head videos well.
User Prompt	Users can use natural language prompts (whether sentences or keywords) to find any scene, action, character, event, emotional moment, viral topic, and more. To learn more about prompts, please visit the ClipAnything Prompts Manual.	Users can only write prompts using keywords mentioned in the original video.
Visual understanding	Analyze and understand everything on screen, including but not limited to: Objects: Characters, animals, items Colors: Yellow flowers, a girl in a purple dress Actions: Patrick Mahomes throwing a touchdown, a host reviewing a product Text/Overlays: Scoreboards, name cards, logo	Limited capability that only understands speaker positions.
Audio understanding	Extending beyond understanding conversations, ClipAnything: Accurately identifies which speaker is talking. Understands a wide range of sounds, including but not limited to: Emotional Sounds: Laughter, cheering, shouting, arguing Environmental Sounds: Bird chirping Sound Effects: Glass shattering, car honking Others: Music and more	Relies solely on the transcript to understand spoken words.
Sentiment analysis	Powerful sentiment analysis that understands a range of emotions, such as happiness, joy, sorrow, surprise, sadness, couple arguing, and more.	No sentiment analysis.
Narratives	Narratives refer to the storylines or the structured progression of events that are conveyed through the video. ClipAnything has a rich narrative library in collaboration with award-winning producers, applying the best narratives to your clips based on your video genre.	No narratives.
Reasoning	ClipAnything excels in video scene analysis, seamlessly integrating text, voice, emotion, and visuals for comprehensive understanding. It performs deep, multimodal reasoning to infer meaning and intent from diverse data. ClipAnything not only understands in-scene content but also excels at cross-scene analysis, providing insights with remarkable depth and clarity.	No reasoning capabilities.