AI Inference Platform Engineer

Login to Send Email

Description

FastAPI Postgres DigitalOcean Django MySQL JavaScript Python Redis/Valkey Kubernetes Grafana ONNX Docker PyTorch AWS

I'm a platform engineer with expertise in AI inference systems (stateless and RAG based), particularly the MLOps side of things. I've scaled GPU clusters to tens of thousands of requests per second in a k8s environment. I've founded a couple (failed) companies, including a humanoid robotics company and an AI Essay writing company with GPT-2, before OpenAI had even made the first commit to their API. I also have a lot of experience working with webservers like Django and FastAPI. I have some experience in management through mentoring a high school robotics team for a few years and the previously mentioned companies. I'm mainly looking for IC roles. As a sort of "showcase" for some of my technical skills, I recently built and open sourced an all-in-one batteries included AI deployment repository: https://github.com/Mockapapella/batteries-included-ai-deployment. In a similar vein, I set up a kubernetes cluster that you can check out here that involve things like DataDog, LoadBalancers, and more: https://github.com/Mockapapella/kubernetes-proxy-inference