Natural Language Processing
MASTERS DEGREE IN DATA SCIENCE AND ENGINEERING
University: Birla Institute of Technology and Science (BITS Pilani, India)
Graduated with Distinction.
Courses undertook: Data Mining, Machine learning, Deep Learning, NLP & Text Mining, Information Retrieval, Big Data Systems, Intro to Data Science, Mathematics for Data Science, Data Visualisation, Statistical Mathematics, Algorithmic Foundation for Data Science
Semester 1: 8.12/10,
Semester 2: 8.63/10,
Semester 3: 9.083/10,
Master's dissertation Topic: Mixed-mode Federated Learning Cloud Based Systems
UNDERGRADUATE BACHELOR'S DEGREE IN COMPUTER SCIENCE AND ENGINEERING
University: West Bengal University of Technology, India
Courses undertaken : All computer Science related courses
Rank: 4 out of 70
PROFESSIONAL WORK EXPERIENCE
SENIOR DATA SCIENTIST, WIPRO LIMITED
Building Search Engine for healthcare with medical documents using NLP & Text mining for Established AI Products using Python.
SENIOR DATA SCIENTIST, IRIS SOFTWARE INC.
Worked in healthcare, bioinformatics project to identify candidate driver genes in a cluster of gene regulatory network using Python.
SENIOR SYSTEMS ENGINEER, INFOSYS LIMITED
Worked on end to end Natural Language Processing Pipeline Techniques for text mining, & document similarities using Python.
Text Mining and NLP: Document Similarity, Language Modelling, Word2Vec, Machine Translation
Healthcare: Cancer Genomics, Bioinformatics, Gene Regulatory Network architecture
Programming Languages: Python, R, Objective-C, C, Java
Deep Learning: Neural Network, CNN, RNN, GAN, Transformer (Attention) architecture, Chatbot, Image classification,
Machine Learning: Regression, Classification, Clustering, Tree Based Algorithms, Bagging, Boosting
Data Science: Data Pipeline Modelling, Data Pre-processing, Data Cleaning, Data Visualization, IBM Watson WML
Tools and Libraries: Spacy, Pandas, Sklearn, PyTorch, Keras, Jupyter, Anaconda, Numpy, Seaborn, SQL, Cloud DevOps
Sequence Alignment (Language: Python) - Used Dynamic Programming in healthcare for genomic sequence alignment from the input data
Branch Correlation in Programming Paradigm (Language: Python) Skills used: Data-driven Programming.
Utilized ROSE compiler to instrument given program to produce dynamic outcome (T/F) for conditional branches
Tririga Building Assistant (IoT - TRIRIGA Assistant) : (Language: Python) - Worked on the Anomaly Detection model that runs in every 15 min gap from CISCO DNA sensor data, USA
Worked on a Guest KPI model that runs everyday to predict which floor has guest(s) using Python
ML algorithms: Classification, regression, clustering, ensemble methods, dimensionality reduction
Deep learning: Neural Net(NN), RNN,LSTM, GRU, optimization techniques
Python libraries: Numpy, Pandas, Sklearn
Computer Vision library: Pytesseract, CV2
Featurization techniques: feature extraction, transformation, Bag of Words, TFIDF, PCA(dimensionality reduction), and selection
Pipelines: Spark, MLLib library for constructing, evaluating, and tuning ML Pipelines on Spark
AI methodologies & NLP: Spacy & NLTK components
Research papers: LeNet-5, BERT, Word2Vec, BioBERT, Clinical BERT, Attention mechanism, Transformer
Presented ‘Attention Is All You Need’ Paper at Portland State University, USA
Participated in the RegML 2020 Conference, University of Genova, Italy
Completed Data Analytics Global Internship, organized by Taken Mind and United Nation, UK
AWARDS & ACHIEVEMENTS
Awarded for ICML 2021 with 100% registration fee scholarship to attend conference and workshop track
Got selected to attend the Google Developer’s program, Google India, 2019
Feel free to contact me for more information regarding my research, career experiences and for new opportunities.