I'm Akshay — Software Engineer on the Big Data Platform at TikTok USDS JV, where I own and scale Spark, ClickHouse, and Doris clusters that power analytics over 200+ petabytes. By night, an MS CS student at Georgia Tech specializing in AI. Previously at Smucker's, Cognizant, and Scale AI.
Operating one of the largest data platforms on the planet — and chipping away at an MS CS in AI on the side.
Software Engineer · Big Data Platform
Owning, optimizing, and scaling the data infrastructure that powers TikTok's U.S. analytics — Spark, ClickHouse, and Doris clusters at ~200 PB scale. Same team I interned on; now here full-time.
Five years of building — from data labeling pipelines and generative-AI tooling to enterprise platforms and big-data infrastructure.
Bits and pieces from late nights, weekends, and side quests — tools, experiments, and study companions.
Poker odds calculator and range-prediction tool with a lightweight UI for live decision making.
Pulls clean, structured text out of PDFs and images using OCR pipelines.
A long-running collection of my LeetCode, HackerRank, and competitive-programming solutions.
Notebook-style deep dive into Pandas and the foundations of practical data science in Python.
From-scratch implementations of classic algorithms — sorts, graph traversals, DP, and more.
Pre-trained CNN, fine-tuned on a custom dataset, for classifying dog breeds from photos.
A hand-rolled best-fit memory allocator written entirely in C — explicit free lists, coalescing, the works.
Sudoku validity checker in C. Tight, fast, and a great exercise in array indexing.
The portfolio you're reading right now — hand-built with vanilla HTML / CSS / JS, no framework.
Engineer who got hooked on big-data systems and never quite looked back.
In January 2026, I started full-time at TikTok USDS JV on the Big Data Platform team — the same team I interned on in 2025, now under the U.S. data-security joint venture spun out alongside Oracle, Silver Lake, and MGX. I'm based in San Jose, in the heart of the Bay Area.
I graduated from the University of Wisconsin–Madison in December 2025 with degrees in Computer Sciences, Data Science, and Mathematics. I'm now also a part-time MS student at Georgia Tech in Computer Science, specializing in Artificial Intelligence.
Day-to-day, I work on Spark, ClickHouse, and Doris clusters — owning, scaling, and optimizing them so analytics workloads over 200+ petabytes stay fast, cheap, and reliable.
Before TikTok, I shipped serverless alerting tooling and fine-tuned generative models at The J.M. Smucker Co., built a generative-AI Python package at Cognizant, and trained LLMs at Scale AI.
Outside of work, I play volleyball and golf, and am currently training for a full marathon.
Software Engineer @ TikTok USDS JV · Big Data Platform · San Jose, CA.
MS CS (AI) @ Georgia Tech · BS @ UW–Madison '25 (CS, Data Science, Math).
Big-data platforms, query engines, distributed storage, AI systems.
Volleyball, golf, and training for a full marathon.