Subash Matu
Tagline:Currently Hooked: Severance | Avid Youtube Binger
Hey!
I’m a data scientist and machine learning engineer with a deep interest in building intelligent systems that actually work in the real world. From neural signal modeling to ensemble LLMs, I enjoy solving complex problems with elegant, scalable solutions. I’m currently pursuing my Master’s in Data Science at Brown University, where I’m diving deep into everything from natural language processing to AI fairness.
At the core of my work is a belief that AI, as a tool, holds the promise to enhance our lives and open doors to ideas and discoveries still unseen. It’s here to stay and by embracing it thoughtfully, we have a chance to shape a more creative and connected future. This space is a little window into what I’m building, learning, and exploring.
Thanks for stopping by!
P.s. My Personal story all the way at the bottom
Ongoing Project and Ideas
DJ Bruno
date: 2025Organization:Ongoing Personal Project
Description:DJ Bruno is a real-time, fully-local personal DJ that curates music by understanding your emotional and situational context through multimodal sensing. It integrates facial expressions, voice input (like humming or singing), screen activity, and location cues to detect what you’re doing and how you’re feeling. These inputs power a central system that pulls from a vector database to build and continuously update a dynamic song queue. Bruno learns your preferences by observing patterns…such as when, where, and how you respond to certain songs, assigning emotional and contextual markers without the need for fine-tuning. A parallel process monitors for shifts in mood or activity, instantly refreshing the queue using song metadata. Users can also give feedback or make specific requests, allowing the system to adapt both in the moment and over time, creating a deeply personalized and responsive listening experience, all while keeping everything private and on-device.
Bruno Feels your Vibe before you do!
P.s Brown Bruno Bear be proud
DeepFake Detection Using FACTOR
date: 2025Organization:Ongoing Deep Learning Final Project
Description:Our project implements a deepfake detection model based on a paper that targets zero-day deepfakes, ones the system hasn’t seen before. Unlike traditional supervised classifiers, this approach detects inconsistencies between a deepfake and its associated claims (e.g., captions, identity, or audio-visual alignment) to flag manipulations, addressing the growing threat of misinformation and reputational harm.
Personal Projects
CodePeek
date: 2024Description:CodePeek is a VS Code extension that generates a "Context File" listing your workspace structure and selected file contents and captures terminal output into a "TerminalContext File." It’s designed for sharing your code/project context with LLMs (like DeepSeekR1, ChatGPT, etc) or team members, making debugging, code reviews, and AI integration seamless. No complex setup…just click and create!
Fascinations and Research Interests
- Polysemanticity
- Mechanistic Interpretability
- Recommendation Systems and Wisdom of Crowd Effects
Teaching Experiences
Teaching Assistant for CS1491 : "Fairness in Automated Decision Making"
from: 2025, until: presentOrganization:Brown UniversityLocation:CSCI
Head Teaching Assistant for Data1030: "Hands-on Data Science"
from: 2024, until: 2024Organization:Brown UniversityLocation:DSI
Teaching Assistant for Text Analysis
from: 2024, until: 2024Organization:Brown UniversityLocation:Watson institute
Research Experiences
Research Assistant
from: 2025, until: presentOrganization:Brown UniversityLocation:Andras Zsom
Description:Engineered a data pipeline to ingest over 100,000 betting records from Polymarket and the Polygon Blockchain, capturing individual-level betting behaviors for the 2024 USA election and analyze cross-market betting behaviors and patterns. Applied a Bayesian cascade model (inspired by the monotone likelihood ratio property) to quantify herd behavior and the “wisdom-of-crowd” effects, modeling each bettor’s decision as a signal that updates the posterior probability of adoption. Uncovered a higher correlation between early adopters’ signals and eventual market trends, highlighting fragile cascades and conditions under which later bettors deviate from early signals.
Machine Learning Researcher
from: 2024, until: 2025Organization:LifeSpan
Description:Developed a subject-specific SEEG signal reconstruction pipeline by fine-tuning a pre-trained CAE-LSTM model, leveraging advanced preprocessing and optimizing functional connectivity analysis (Granger causality, PLV) to model neural interactions, improving reconstruction accuracy and achieving a Pearson correlation of 0.9. Engineered a robust evaluation framework for SEEG signal reconstruction, integrating statistical similarity metrics (Hurst exponent: 0.72, Mutual Information: 1.34) and dynamic visualizations. Enabled deeper insights into brain signal reconstruction, improving interpretability for neurological research and clinical applications.
Industry Experiences
Machine Learning Summer Intern
from: 2024, until: 2024Organization:Citizens BankLocation:Jhonston RI
Description:Developed a hybrid ML model combining EfficientNet and triplet networks to detect signature forgeries with 87% accuracy. Leveraged advanced preprocessing and image augmentation techniques to enhance model robustness. Presented model insights to stakeholders through visualizations, explaining fraud detection performance and business impact.
Cloud Engineer Intern
from: 2022, until: 2022Organization:Disney - IT Globe
Description:Managed critical Network Operations Center functions for Disney using AWS for scalable cloud solutions. Utilized ServiceNow for efficient incident tracking and Zenoss for robust system monitoring. Engineered and maintained backend dashboards with Grafana and Pingdom for real-time performance tracking.
Software Developer Intern
from: 2021, until: 2021Organization:Australian Applied Engineering SystemsLocation:Austrlia (Online)
Description:Led and built a Website Development team to create and deploy a full-stack website, integrating a data analytics module that plotted user interaction data and refined inventory management, significantly boosting efficiency
Web Developer Intern
from: 2020, until: 2021Organization:Best Roadways Limited
Description:I spearheaded the design and development of a fully functional website, incorporating features like shipment tracking, dynamic fluid animations, and contemporary design principles. Additionally, I implemented a user-friendly CMS portal for seamless content management and collaborated closely with cross-functional teams to integrate diverse functionalities.
Certifications
Introduction to Data Science in Python
Issue date: Feb 2023,
Issued by: University of Michigan | Coursera .
Data Analytics with Python
Issue date: May 2022,
Issued by: IIT Roorkee .
The Joy of Computing using Python
Issue date: Nov 2021,
Issued by: IIT Madras .
Personal Story
I grew up in Detroit, Michigan, always drawn to building things with my hands and figuring out how stuff works. In 5th grade, I started learning CATIA and built little mockups of Beyblades using magnets, just because I was curious about how to make them spin and move the way I saw in the anime. Around that same time, I started making websites, simple ones at first, but I was fascinated by how writing a few lines of code could turn into something real on the screen.
Later, I moved to India and lived across different states, which helped me pick up a mix of languages and perspectives. Somewhere during that time, my family was working on building a home, and I used SketchUp to design a detailed mockup of it, something I’m still proud of, especially since it actually got built.
I went on to study computer science at SRM, mostly because I’ve always loved the feeling of making ideas real. During the pandemic, I built a live COVID tracker for my town in India, combining API integration, Vue.js for the frontend, and GitHub for data. I also created a system to help students on campus recover lost items, using a pretty scrappy image recognition model but effective enough approach that it was actually tested and used.
Through all of it, I’ve just followed my curiosity... learning whatever skill I needed to bring an idea to life. Whether it’s something small or something a bit more ambitious, the goal’s always been the same: making something real out of a thought and making it useful.