Foundry is building the future of AI infrastructure with our Cloud Platform, providing self-serve access to high-performance GPU compute for training, fine-tuning, and serving AI models. We’re simplifying infrastructure for dynamic AI workflows, enabling AI practitioners to focus on innovation, not infrastructure.
We’re well-funded ($80M, Series A), growing quickly, and looking for talented people to join our team.
Here are some of the roles we’re hiring for:
* Senior Software Engineer, Full Stack
Design and build our compute marketplace and products. Focus on both backend and frontend technologies, REST APIs, and microservice architecture. [6+ years experience with Typescript, Python, etc.]
* Site Reliability Engineer (SRE), Cloud
Build reliable systems for AI workflows. Work across Kubernetes, Linux, and cloud services to ensure platform scalability and performance. [Focus on system design and reliability.]
* Site Reliability Engineer (SRE), Supply
Work with our supply partners to build & maintain reliable systems for distributed compute. Work across Kubernetes, Linux, and cloud services to ensure reliability, scalability and performance. [Focus on customer interactions and reliability.]
* Software Engineer, Security Engineer
Design and implement security strategies for our AI/ML infrastructure. Build systems that keep our platform secure at scale.
Learn more about us and apply: www.mlfoundry.com/company Or email us directly: careers@mlfoundry.com
Stanford Research Computing (https://srcc.stanford.edu) is a collaboration between University IT and the Vice Provost and Dean of Research. We operate HPC environments for researchers, we do one-time consultations on projects (from software and pipelines, to data management, to physical building design and fit-out), and we provide contract support for individual Labs, Departments, and Schools.
We have two open positions:
• GPU Cluster Sysadmin: With Marlowe—our 1SU NVIDIA DGX H100 SuperPOD with DDN Intelliflash and DDN NFS storage—launched, we have decided to hire an additional sysadmin! You'll be working with the latest AI/ML/Deep Learning/LLM software & frameworks, getting them to work in an HPC environment. You'll be keeping the environment up-to-date, and working with NVIDIA/DDN when there's trouble. You should also expect to interact with users & PIs a lot. More info: http://phxc1b.rfer.us/STANFORD8mESql (or, if you're very experienced in the field, go to http://phxc1b.rfer.us/STANFORDrWNSqm).
• Research Computing Systems Engineer: We run lots of different compute environments. Some of them have to deal with HIPAA data, which presents unique challenges, and so we are hiring an additional sysadmin! You should already have experience with Linux system administration; as well as the necessary networking, storage, and configuration management skills needed to bring up a compute environment. Ideally you will also have experience with job schedulers (SLURM, LSF, etc.) and the peculiarities of working with the requirements for managing data under HIPAA. More info: http://phxc1b.rfer.us/STANFORD7u6TAI
If you don't already live in the Bay Area, we provide a relocation incentive. Depending on where you live, we provide free transit passes. Unfortunately, if you drive, you will have to pay for parking for the days you're on-site. There is some on-call around the holidays. We get a 403(b) match, good healthcare, and 30+ days off per year (holidays + vacation). All Benefits are all publicly documented at https://cardinalatwork.stanford.edu/benefits-rewards.
If you have questions, feel free to reply here or email me (the info is in my profile)!
We build web3 developer tools and are most focused on solving the "schlep" types of problems that every team ends up building in house but shouldn't have to.
Building stuff in crypto today is really hard. Imagine if you wanted to put up a personal website and all you had to do was start by breaking ground to build a data center. Your personal page probably would not be that good, would it? Small teams can only focus on so many things; we want them to use center so they can focus on the details that truly matter and make products beautiful.
We want to eliminate all of these annoying, cumbersome, expensive problems that devs face so that they can build incredible user experiences that help people flourish and rise to the level of their ambition, without having to think about if their nodes are online or if their indexer is missing any data.
We started about 3 years ago and have finally started to find product market fit. We're looking to scale the engineering team so that we can finally bring our products to market.
Our investors include Founders Fund, Thrive Capital, SV Angel, @balajis, and many more incredible folks who have taken a chance on us; it's been my mission for the last 3+ years to make sure that everyone involved in this venture does phenomenally well.
https://jobs.ashbyhq.com/center
Or email me directly: omar+hn@center.app