Soorya Prasanna Ravichandran

Software Engineer

Dallas, TX

SPR

About πŸ™‹πŸ»β€β™‚οΈ

Highly skilled software developer with 10 years of experience in designing, developing, leading and maintaining data-intensive distributed systems and enterprise applications.

Experience πŸ‘¨πŸ»β€πŸ’»

Vice President β†’ Core Data Engineering

Jan 2021 - Present

  • Data Lake β†’ Lead for the firm’s enterprise Data Lake platform, overseeing core services (ingest, sandbox, and simulator) to support reporting and analytics across ~160K unique datasets ingesting ~180K daily data pipelines.
  • Spearheaded the data & operational catalog, and built the ingest application using HBase, Flink, HDFS, and Hadoop YARN.
  • Streamlined SDLC and reduced time to production by 75% through the adoption of blue/green deployment strategy.
  • Lakehouse β†’ Lead Architect for Lake to Lakehouse migration, designing and implementing ingest migration architecture to enhance performance and reduce costs.
  • Engineered parallel ingest components, enabling dual ingest pipelines for seamless migration, enhancing scalability, and creating a failover environment for greater reliability.
  • Designed and implemented workflows to decommission 25% of Lake refiners & datastores, migrating unused data to cold storage and cutting quarterly storage costs by ~$500K.
  • Collaborated with Snowflake Engineers to develop the Snowpark Migration Accelerator, streamlining refiner categorization and auto-migration to Lakehouse.
  • Managed and mentored junior and mid-level developers, providing regular coaching through 1:1 sessions and career development planning.
  • Associate Software Engineer

    Jan 2020 - Dec 2020

  • Architected and developed key components of Data Lake, including AWS S3 Ingest service, Data Partitioner, Snowflake Ingest service, Avro-Parquet Scanner & Converter utility, ensuring high reliability and on-time delivery.
  • Built a Simulator tool that enabled early bug detection, fostered developer mindfulness, and improved performance tuning, resulting in an 80% increase in customer satisfaction.
  • Analyst Software Engineer

    Nov 2017 - Dec 2019

  • Contributed to the core backend metadata services for data cataloging, ingest and refinement for early stage enterprise Data Lake platform.
  • Migrated ingest framework from legacy MapReduce to Apache Flink, reducing data availability latency by 25% and optimizing memory usage by 33%.
  • Curriculum Developer

    Jun 2017 - Oct 2017

  • Led the Big Data Club as primary instructor, guiding students through hands-on projects with Big data tools and technologies to enhance practical coding skills and understanding.
  • Developed and optimized course content, exercises, and learning pathways in collaboration with the Center for CS Education & Outreach to improve student engagement and mastery of big data concepts.
  • Software Engineer

    Sep 2013 - Jul 2015

  • Achieved 100% predictability by building an Emergency Alert Solution to warn communities under fire and flood danger in Australia.
  • Developed, optimized and delivered processed extracts on a Data Warehouse ETL platform β†’ Network Decision Support Database for Telstra Corporation, Australia.
  • Software Engineer Intern

    Feb 2013 - May 2013

  • Developed Online Shopping Cart that markets seasonal clothes to customers based on their geolocation.
  • Rapidly prototyped new capabilities to ensure feasibility of the project.
  • Education πŸ‘¨πŸ»β€πŸŽ“

    The University of Texas at Dallas

    2015 - 2017
    Master of Science in Computer Science Β· Data Science Major

    Anna University, India

    2009 - 2013
    Bachelor of Engineering in Computer Science

    Volunteering πŸ‘πŸ½

    Mentor β†’ Goldman Sachs Emerging Leaders Series (ELS)

    2024 - 2025

    Mentored sophomore students to cultivate essential industry skills that will help them become better engineers and explore opportunities at Goldman Sachs.

    Mentor β†’ Goldman Sachs BAI/HLAI Network

    2021 - 2025

    Mentored interns, junior and mid-level engineers through the Black Analyst Initiative and Hispanic/Latinx Analyst Initiative to promote diversity within Goldman Sachs.

    GS Campus Recruiting Coordinator Β· North Americas

    2019 - 2025

    Led the recruiting team to identify, source, and hired 100+ engineering talents across the U.S. and Canada, ensuring alignment with business needs for the Goldman Sachs Engineering Division.

    UT Dallas Scouting Team

    2020 - 2025

    Organized Hackathons and various engineering events at The University of Texas at Dallas leading Goldman Sachs Engineers.

    Big Data Instructor Β· UT Dallas

    2016 - 2017

    Developed competency among 500+ students and 300+ industry professionals by conducting hands-on workshops and seminars on Big Data technologies.

    CS Outreach Coordinator

    Summer 2016

    Designed and facilitated programming workshops for middle and high school students to foster early interest in STEM, while encouraging informed career choices through exposure to computer programming concepts; provided guidance to college students, enhancing their programming skills, supporting coursework and projects, and helping them secure internships to improve job prospects and industry readiness.

    Programming πŸ₯·πŸ»

    Java
    TypeScript
    JavaScript
    SQL
    Python
    C++

    Technologies πŸ› οΈ

    AWS
    Flink
    HDFS
    HBase
    Iceberg
    Kafka
    Snowflake
    Apache Avro
    Apache Parquet
    MapReduce
    Spark
    Hive
    REST
    JUnit
    CI/CD
    Git
    Maven
    Gradle
    NoSQL
    Mongo DB
    Bash Shell Script
    Pandas
    SAP Sybase IQ

    Projects πŸŽ“

    Yelp Challenge

    Prediction of Yelp Review Star Rating β†’ Sentiment Analysis & Recommendation Model.

    Java
    Scala
    PigLatin
    Big Data

    Shopping Website

    Developed scalable web application to market wide range of tech products.

    Javascript
    PHP
    CSS
    MySQL

    Kaggle Challenge

    Implemented Naive Bayes, SVM and KNN classifiers across complex multivariate dataset with 75% accuracy.

    R Programming
    Machine Learning

    Tweets Clustering

    Implemented supervised classification techniques on Twitter dataset using clustering algorithms.

    Java
    Machine Learning

    Car Rental System

    Designed a complete database for a car rental system with all the features. Database was designed using PLSQL

    PLSQL
    Design

    Game of Craps

    Coded the famous casino game 'Craps' to implement machine learning strategies to automate wager and to maximize profit.

    Python
    Machine Learning

    Critical Paths

    Implemented the critical path method (CPM) algorithm in PERT charts to schedule a set of project activities.

    Java
    Algorithms

    Indexed Heap

    Implemented Indexed Priority Queues, Prim’s Minimum Spanning Tree Algorithm, Dijkstra’s Shortest Path Algorithm along with performance comparison.

    Java
    Algorithms

    Skip List

    Implemented the Skip List data structure and compare its performance with Tree Map data structure.

    Java
    Algorithms

    Euler Tour

    Implemented Hierholzer's algorithm to find an Euler tour for any given graph.

    Java
    Algorithms

    Taggr

    Developed an application that is capable of processing repository of videos, convert them to searchable hashtags using Optical Character Recognition (OCR).

    Java
    Cloud Computing

    Portfolio

    Personal webpage built using Next.js, deployed on Vercel.

    Next.js
    React