HomeCompaniessieve

sieve

AI + human review to solve data cleaning - accessible via API or Excel

sieve solves data cleaning for hedge funds and investment firms by delivering clean, reliable data wherever they already work - via APIs for devs and Excel for analysts. Currently, many firms rely on manual review steps where engineers literally receive emails containing data they need to review. sieve replaces that with an API that integrates directly into their existing data pipelines. Instead of raising for human review, they can send all the same information to our API and get clean, trustworthy data back. On top of our API, we offer an Excel integration that lets analysts pull data from any source (filings, pdfs, images - even ones in other languages!) into their models with with just a URL. Analysts and data scientists complain that AI alone falls short of their accuracy requirements. They end up needing to check AI outputs against source documents to confirm accuracy. We solve this last mile problem by integrating expert reviewers to catch and correct the issues that AI alone doesn't get right.
Active Founders
Nicole Lu
Nicole Lu

Nicole Lu, Founder

Nicole is co-founder and CEO of sieve, which is solving data cleaning for hedge funds and investment firms. She previously worked at Citadel across Equity Quant Research, Global Trading, and Data Science groups, was a management consultant at McKinsey, and developed cancer detection algorithms at the Broad Institute.
Savannah Tynan
Savannah Tynan

Savannah Tynan, Founder

Savannah is CTO & Co-founder of sieve! Previously she worked at Bain where she spent the majority of her time in the private equity group, and did an externship at a pre-seed start up which she helped grow from 0 revenue to $10M ARR during her 6 months with the company. Prior to that, she did applied ML research at MIT's Environmental Solutions Initiative and studied CS at MIT.
sieve
Founded:2025
Batch:Spring 2025
Team Size:2
Status:
Active
Location:San Francisco
Primary Partner:Tom Blomfield
Company Launches
sieve helps hedge funds extract and clean data from any source directly into their existing tools
See original launch post

The problem

Right now, the most sophisticated hedge funds pay over-qualified data engineers to spend hours a day on the under-levered task of data cleaning. It's crazy to anyone outside of the industry, but in finance even earnings dates (the date a company reports earnings) are considered a known hard problem.

https://youtu.be/V-OpzDRge3o

Our ask

We've solved this problem and our ask to the YC community is - connect us with anyone you know who works in the investing world and uses data (should be all of them). You can email us at founders@usesieve.com.

Who we are

After studying CS at MIT, I worked at Citadel and McKinsey, and Savannah worked at Bain. We've dealt with this problem first hand and were amazed that there weren't good solutions.

What we've built

sieve lets hedge funds replace manual review in their data pipelines with a simple API call, so their engineers and analysts can get back to more differentiated work. Behind the scenes, we use AI to find and retrieve the appropriate source documents and to extract the requested data. Each data point is reviewed by a team of expert reviewers before being returned to the client. This lets us achieve the level of accuracy hedge funds need, and that AI-only approaches aren't able to achieve.

We offer direct API access and access via other tools like Excel. Watch below to see how the excel integration works!

https://youtu.be/r6A1t_X2kZo

What the early results look like

  • sieve is able to replace manual data cleaning workflows:
    • accurately (100% match to data that was hand-collected internally)
    • more scalably (weeks of full-time work to review data became a passive background task for the data scientist)
    • more cheaply (easily 60-70% cheaper than existing options, even when that existing option is BPO outsourcing)

In our first week of YC, we were told that we need to be at least one of: better, faster, or cheaper.