Long before most of us were thinking about large language models, DataCebo co-founders Kalyan Veeramachaneni and Neha Patki were creating an open source library called Synthetic Data Vault, or SDV for short. The company’s roots go back to 2016 when both were working in the MIT Data to AI Lab. For companies, which need to use quality business data in large language models (and for other purposes) but who can’t necessarily use PII to do it, this is an intriguing idea.