HomeCompaniesShofo

Common Crawl for Videos

Shofo builds large-scale social media training datasets by collecting, labeling, and enriching public content for pre-training and fine-tuning AI models. Our indexes continuously collect and update every video across hard to access social media platforms. Companies use Shofo to avoid building their own data collection and processing systems and spending months turning unstructured social content into model-ready data. We started with TikTok and are expanding across every major social platform.
Active Founders
Bryan Hong
Bryan Hong
Founder & CEO
Co-Founder & CEO @ Shofo | Berkeley Dropout
Andre Braga
Andre Braga
Founder & Head of AI
Co-Founder & Head of AI @ Shofo | UCSB BS Statistics & Data Science | Prev. MIT
Braiden Dishman
Braiden Dishman
Founder & COO
Co-Founder & COO @ Shofo | UCSB BA Economics | Prev. AWS
Alexzendor Misra
Alexzendor Misra
Founder & CTO
Co-Founder & CTO @ Shofo | UCSB Dropout | Prev. CEO @ Correkt (43k users)
Company Launches
Shofo - Common Crawl for Videos
See original launch post

TL;DR

Shofo builds complete pipelines that collect, segment, sanitize, and label videos from across social media to curate custom datasets for AI labs.

https://youtu.be/J8wFAjQun8Y

❌ The Problem

AI labs need massive video datasets, but high-quality, segmented video data is hard to access.

✅ Our Solution

We started by building the largest index of short-form videos. Then we run an end-to-end pipeline that sanitizes and applies object and activity detection, reasoning, and segmentation to produce custom training datasets for AI labs.

For example, if a lab needs 50k cooking videos featuring hand-object interactions, We query our index, run the results through our labeling pipeline, and deliver a clean, annotated dataset.

🙏 Our Ask

If you or anyone you know is working with computer vision, we'd love to chat. Reach out at founders@shofo.ai or fill out this form here.

📖 Our Story

Our team met while building a previous startup called Correkt. Correkt was an AI search engine focused on multimodal content and reached over 40k users before pivoting to become Shofo.

uploaded image

From left to right: Braiden (COO), Alex (CTO), Bryan (CEO), and Andre (Head of AI)

😼 Appendix

Website: https://www.shofo.ai

Hugging Face: https://huggingface.co/Shofo

Linkedin: https://www.linkedin.com/company/shofoai

X: https://x.com/shofoai

Shofo
Founded:2025
Batch:Winter 2026
Team Size:4
Status:
Active
Location:San Francisco
Primary Partner:Jared Friedman