Summary
Overview
Work History
Education
Skills
Websites
Languages
Certification
Research Patents
Timeline
Generic

Reshinth Adithyan

London,London

Summary

Machine Learning Research Scientist with research interests directed towards pre-training, alignment in CodeLMs/LMs with relevant experience in engineering towards high scale performance distributed training in GPUs(Pytorch) and TPUs(Jax)

Overview

5
5
years of professional experience
1
1
Certification

Work History

Research Scientist

Stability AI
London
12.2022 - Current
  • Led pre-training of StableCode model on a large GPU HPC cluster, which achieved SOTA results. Refer [https://huggingface.co/collections/stabilityai/stable-code-64f9dfb4ebc8a1be0a3f7650]
  • Have worked on Pre-training of internal 22B Code Model in TPUs with JAX.
  • Large scale data processing and cleaning of pre-training data with pyspark of volumes of text data in TBs.
  • Conducted various ablation studies to study nextgen CodeLMs and scaling laws abiding to increase the Reasoning capabilities of CodeLMs.
  • Developed methods to increase the reasoning and coding capabilities of base models.

Research Engineer

Saama Technologies
Chennai
10.2021 - 12.2023
  • Responsible for devising DSLs for Clinical Trial Transformation.
  • Responsible for research on DSL-based Neural Program Synthesis
  • Built a Grounded Neural Search in DSL’s Grammar.
  • Program Synthesis embedded in Python for Low Code No Code IDE.

Research Engineer

Tata Consultancy Services
Chennai
05.2019 - 10.2023
  • Responsible for devising, building unsupervised techniques to represent various forms of source codes such as DFG, AST, CFG.
  • Responsible for devising methods to compile code using Deep Learning without the use of an explicit Compiler.
  • Primarily used, Graph Neural Networks, Transformer based Architectures.
  • Expression Evaluation in Conditional Statements for Test Case Generation using Data Flow Graph.

Product - TransformPlus

Research Intern

Tata Consultancy Services
Chennai
01.2023 - 04.2023
  • Building Graph Neural Network-based Models to represent Naturalness in a Data Flow Graph of a System.
  • Naturalness via Hardcoded Values and Variable Names.

Education

Bachelor of Technology - Mechanical Engineering

Sri Manakula Vinayagar Engineering College
Pondicherry, India
04.2019

Skills

  • Large Scale GPU/TPU LM/CodeLM pretraining in large HPC
  • Working on designing large scale training runs and conducting large scaling studies
  • Maintained large scale neural network training library
  • Working on adapting CodeLMs to production level system
  • Pytorch/Jax with experience in GPUs and TPUs

Languages

Tamil
First Language
English
Proficient (C2)
C2
French
Upper Intermediate (B2)
B2

Certification

  • Improving Deep Neural Networks: Hyperparameter Tuning, Regularization and Optimization -https://www.coursera.org/account/accomplishments/verify/BRJEEMQ4QWXX
  • Structuring Machine Learning Projects - https://www.coursera.org/account/accomplishments/verify/DDKM8RSHJ5QP

Research Patents

  • Method and system for translation of codes based on semantic similarity [https://patents.google.com/patent/US20230034984A1/en?q=(Reshinth)&oq=Reshinth]
  • Method and system for extracting natural language elements embedded in application source code[https://patents.google.com/patent/US11853710B2/en?q=(Reshinth)&oq=Reshinth]
  • Method and system for inferencing logic out of an application source[https://patents.google.com/patent/US20220222069A1/en?q=(Reshinth)&oq=Reshinth]

Timeline

Research Intern

Tata Consultancy Services
01.2023 - 04.2023

Research Scientist

Stability AI
12.2022 - Current

Research Engineer

Saama Technologies
10.2021 - 12.2023

Research Engineer

Tata Consultancy Services
05.2019 - 10.2023

Bachelor of Technology - Mechanical Engineering

Sri Manakula Vinayagar Engineering College
Reshinth Adithyan