Tensorfuse

Run serverless GPUs on your own cloud

Winter 2024

Active

https://www.tensorfuse.io/

Run serverless GPUs on your own cloud

Tensorfuse helps you run fast, scalable AI inference in your own AWS account. Run any model, use any inference server (vLLM, TensorRT, Dynamo) and get ready to scale your AI inference to 1000s of users - all set up in under 60 mins Just bring: 1. Your code and env as Dockerfile 2. Your AWS account with GPU capacity We handle the rest—deploying, managing, and autoscaling your GPU containers on production-grade infrastructure.

Active Founders

Samagra Sharma

Samagra Sharma, Founder

Samagra is the Co-Founder and CEO of Tensorfuse. Samagra has deep expertise in deploying production Machine Learning systems owing to his work on Multimodal Content Generation at Adobe Research and ML systems for network telemetry at UCSB. Samagra is a published AI researcher and holds a patent on Multimodal Content Generation. Additionally, Samagra authored the Java implementation of "AI: A Modern Approach," a widely used AI textbook in over 1,500 universities around the globe.

Agam Jain

Agam Jain, Founder

Agam is the co-founder and CPO at Tensorfuse. Previously, he worked as a Computer Vision researcher at Qualcomm, where he published a paper and obtained a patent in image upscaling.

Jobs at Tensorfuse

View all jobs

Founding Engineer (Systems/Infrastructure)

Bengaluru, KA, IN

₹2.5M - ₹4M INR

0.50% - 0.80%

1+ years

Apply Now ›

Tensorfuse

Founded:2023

Batch:Winter 2024

Team Size:2

Status:

Active

Primary Partner:Tom Blomfield

Company Launches

🚀 Tensorfuse - Deploy and scale LLM pipelines on your own cloud

See original launch post

Tl;dr

Tensorfuse makes it easy to deploy and manage LLM pipelines on your own cloud. Simply connect your cloud to Tensorfuse, select your model, point to your data and click deploy. Tensorfuse will provision and manage the underlying infrastructure for you. Behind the scenes, we manage K8s + Ray clusters, enabling you to scale without LLMOps overhead.

About us

We are Agam and Samagra. We have experience deploying production machine learning systems at scale at Adobe and Qualcomm. You can see some of our work while using Adobe Scan. Additionally, Samagra authored the Java implementation of "AI: A Modern Approach," which is a widely used AI textbook in over 1,500 universities worldwide. Agam worked as a Computer Vision researcher at Qualcomm, where he published a paper and obtained a patent for image upscaling.

Problem

Companies in regulated spaces are constrained to build LLM apps on their cloud to maintain control over their data. However, managing and Scaling LLM infra is hard and it requires LLMOps expertise. Companies face the following issues:

❌ Deployment complexities increase development time and operational overhead

❌ Auto-scaling requires sophisticated solutions and there are not enough LLMOps experts in the market

✅ Solution

Tensorfuse provides a single API to manage your infra. Simply connect your cloud to Tensorfuse, select your model, point to your data and click deploy. Tensorfuse will provision and manage the underlying infrastructure for you.

One of our clients managed to deploy a production-ready retriever in just 6 days, a process that would have otherwise required months of experimentation.

❇️ Ask

If you are facing issues in deploying LLM apps on your infra, contact us at founders@tensorfuse.io
Follow us on LinkedIn and Twitter for more updates.