Subash Matu

Tagline:Currently Hooked: Severance | Avid Youtube Binger

personal photo of Subash Matu

Hey!

I’m a data scientist and machine learning engineer with a deep interest in building intelligent systems that actually work in the real world. From neural signal modeling to ensemble LLMs, I enjoy solving complex problems with elegant, scalable solutions. I’m currently pursuing my Master’s in Data Science at Brown University, where I’m diving deep into everything from natural language processing to AI fairness.

At the core of my work is a belief that AI, as a tool, holds the promise to enhance our lives and open doors to ideas and discoveries still unseen. It’s here to stay and by embracing it thoughtfully, we have a chance to shape a more creative and connected future. This space is a little window into what I’m building, learning, and exploring.

Thanks for stopping by!

P.s. My Personal story all the way at the bottom

Education

  • Master's degree

    from: 2023, until: 2025

    Field of study:Data ScienceSchool:Brown University

    • brownlogo

Ongoing Project and Ideas

  • Confer

    date: 2025

    Organization:Ongoing Personal Project

    Description:

    Confer is an intelligent research assistant that simplifies academic papers by embedding LLM-powered helpers directly into the content, enabling intuitive understanding of complex concepts.

    • 7A50F5E2-40E6-402A-826F-5D720994E0F4
  • DJ Bruno

    date: 2025

    Organization:Ongoing Personal Project

    Description:

    DJ Bruno is a real-time, fully-local personal DJ that curates music by understanding your emotional and situational context through multimodal sensing. It integrates facial expressions, voice input (like humming or singing), screen activity, and location cues to detect what you’re doing and how you’re feeling. These inputs power a central system that pulls from a vector database to build and continuously update a dynamic song queue. Bruno learns your preferences by observing patterns…such as when, where, and how you respond to certain songs, assigning emotional and contextual markers without the need for fine-tuning. A parallel process monitors for shifts in mood or activity, instantly refreshing the queue using song metadata. Users can also give feedback or make specific requests, allowing the system to adapt both in the moment and over time, creating a deeply personalized and responsive listening experience, all while keeping everything private and on-device.

    Bruno Feels your Vibe before you do!

    P.s Brown Bruno Bear be proud

  • DeepFake Detection Using FACTOR

    date: 2025

    Organization:Ongoing Deep Learning Final Project

    Description:

    Our project implements a deepfake detection model based on a paper that targets zero-day deepfakes, ones the system hasn’t seen before. Unlike traditional supervised classifiers, this approach detects inconsistencies between a deepfake and its associated claims (e.g., captions, identity, or audio-visual alignment) to flag manipulations, addressing the growing threat of misinformation and reputational harm.

Personal Projects

  • Promptly

    date: 2025

    Description:

    An AI-powered tool that learns your unique speaking and writing style, suggesting words seamlessly. It continuously adapts, not just to your evolving language but also to the way you communicate with different people, just like you naturally do.

    • Promptly
  • AutoPilotAI

    date: 2025

    Description:

    AutoPilot AI is a system that lets computers control themselves using PyAutoGUI and OmniParser under the hood. It automates tasks, navigates interfaces, and executes actions just like an AI-powered autopilot for your machine.

    • AutoPilotAI
  • LupoAI

    date: 2025

    Description:

    A multi-LLM ensemble framework designed to minimize hallucinations and boost factual accuracy by tapping into majority consensus across models. Built for reliability, this system enhances output trustworthiness through smarter model coordination.

    • Lupo
  • CodePeek

    date: 2024

    Description:

    CodePeek is a VS Code extension that generates a "Context File" listing your workspace structure and selected file contents and captures terminal output into a "TerminalContext File." It’s designed for sharing your code/project context with LLMs (like DeepSeekR1, ChatGPT, etc) or team members, making debugging, code reviews, and AI integration seamless. No complex setup…just click and create!

    • icon

Fascinations and Research Interests

  • Polysemanticity
  • Mechanistic Interpretability
  • Recommendation Systems and Wisdom of Crowd Effects

Teaching Experiences

  • Teaching Assistant for CS1491 : "Fairness in Automated Decision Making"

    from: 2025, until: present

    Organization:Brown UniversityLocation:CSCI

  • Head Teaching Assistant for Data1030: "Hands-on Data Science"

    from: 2024, until: 2024

    Organization:Brown UniversityLocation:DSI

  • Teaching Assistant for Text Analysis

    from: 2024, until: 2024

    Organization:Brown UniversityLocation:Watson institute

Research Experiences

  • Research Assistant

    from: 2025, until: present

    Organization:Brown UniversityLocation:Andras Zsom

    Description:

    Engineered a data pipeline to ingest over 100,000 betting records from Polymarket and the Polygon Blockchain, capturing individual-level betting behaviors for the 2024 USA election and analyze cross-market betting behaviors and patterns. Applied a Bayesian cascade model (inspired by the monotone likelihood ratio property) to quantify herd behavior and the “wisdom-of-crowd” effects, modeling each bettor’s decision as a signal that updates the posterior probability of adoption. Uncovered a higher correlation between early adopters’ signals and eventual market trends, highlighting fragile cascades and conditions under which later bettors deviate from early signals.

  • Machine Learning Researcher

    from: 2024, until: 2025

    Organization:LifeSpan

    Description:

    Developed a subject-specific SEEG signal reconstruction pipeline by fine-tuning a pre-trained CAE-LSTM model, leveraging advanced preprocessing and optimizing functional connectivity analysis (Granger causality, PLV) to model neural interactions, improving reconstruction accuracy and achieving a Pearson correlation of 0.9. Engineered a robust evaluation framework for SEEG signal reconstruction, integrating statistical similarity metrics (Hurst exponent: 0.72, Mutual Information: 1.34) and dynamic visualizations. Enabled deeper insights into brain signal reconstruction, improving interpretability for neurological research and clinical applications.

Industry Experiences

  • Machine Learning Summer Intern

    from: 2024, until: 2024

    Organization:Citizens BankLocation:Jhonston RI

    Description:

    Developed a hybrid ML model combining EfficientNet and triplet networks to detect signature forgeries with 87% accuracy. Leveraged advanced preprocessing and image augmentation techniques to enhance model robustness. Presented model insights to stakeholders through visualizations, explaining fraud detection performance and business impact.

  • Cloud Engineer Intern

    from: 2022, until: 2022

    Organization:Disney - IT Globe

    Description:

    Managed critical Network Operations Center functions for Disney using AWS for scalable cloud solutions. Utilized ServiceNow for efficient incident tracking and Zenoss for robust system monitoring. Engineered and maintained backend dashboards with Grafana and Pingdom for real-time performance tracking.

  • Software Developer Intern

    from: 2021, until: 2021

    Organization:Australian Applied Engineering SystemsLocation:Austrlia (Online)

    Description:

    Led and built a Website Development team to create and deploy a full-stack website, integrating a data analytics module that plotted user interaction data and refined inventory management, significantly boosting efficiency

  • Web Developer Intern

    from: 2020, until: 2021

    Organization:Best Roadways Limited

    Description:

    I spearheaded the design and development of a fully functional website, incorporating features like shipment tracking, dynamic fluid animations, and contemporary design principles. Additionally, I implemented a user-friendly CMS portal for seamless content management and collaborated closely with cross-functional teams to integrate diverse functionalities.

Certifications

  • Introduction to Data Science in Python

    Issue date: Feb 2023,

    Issued by: University of Michigan | Coursera .

  • Data Analytics with Python

    Issue date: May 2022,

    Issued by: IIT Roorkee .

  • The Joy of Computing using Python

    Issue date: Nov 2021,

    Issued by: IIT Madras .

Personal Story

I grew up in Detroit, Michigan, always drawn to building things with my hands and figuring out how stuff works. In 5th grade, I started learning CATIA and built little mockups of Beyblades using magnets, just because I was curious about how to make them spin and move the way I saw in the anime. Around that same time, I started making websites, simple ones at first, but I was fascinated by how writing a few lines of code could turn into something real on the screen.

Later, I moved to India and lived across different states, which helped me pick up a mix of languages and perspectives. Somewhere during that time, my family was working on building a home, and I used SketchUp to design a detailed mockup of it, something I’m still proud of, especially since it actually got built.

I went on to study computer science at SRM, mostly because I’ve always loved the feeling of making ideas real. During the pandemic, I built a live COVID tracker for my town in India, combining API integration, Vue.js for the frontend, and GitHub for data. I also created a system to help students on campus recover lost items, using a pretty scrappy image recognition model but effective enough approach that it was actually tested and used.

Through all of it, I’ve just followed my curiosity... learning whatever skill I needed to bring an idea to life. Whether it’s something small or something a bit more ambitious, the goal’s always been the same: making something real out of a thought and making it useful.