+1(480)9307074
Satish Bhambri
Senior Data Scientist | MLE | Computational Astrophysics
![]() | ![]() | ![]() |
---|---|---|
![]() |
INTELLIGENT FEEDBACK SYSTEM FOR COURSES
(RESEARCH PROJECT)
Based on the reflections provided for the Assignments, this system predicts the current standing/skill level and understanding of the subject matter by the student.
Approach :
• Uses Natural Language Processing to analyze the text data
• Latent Semantic Analysis
• Word2Vec, TF-IDF
• Exploratory Data Analysis and Feature Vector Generation
• Word Vectors generated to train the data model
• Predefined Keyword analysis for clustering
Spectral Clustering
• The cluster shape non pre-deterministic
• Laplacian Calculations
• Eigenvalue Matrix and Normalization
• Suitable Clustering Algorithm
• Cluster Analysis
RECOMMENDER SYSTEM FOR OPTIMAL TEAM FORMATION OF RESEARCHERS
(RESEARCH PROJECT)
Given a dataset of researchers who have worked on different areas of research our system selects researchers and recommends the most favourable group that can work together
- Used NLP techniques such as Doc2Vec and different clustering models of Hierarchical Clustering, Spectral Clustering and Minimum Sum graph based algorithm to determine the best possible match for the given problem
- Model evaluation is done based on the Node edge and Attribute feature vectors and their comprehensive
scores among different models
- Doc2Vec Ranking
- Hierarchical Clustering with Doc2Vec
- Spectral Clustering with Doc2Vec (with K Means after Eigen Value Decomposition)
- Minimum Sum - TF algorithm
INTELLIGENT MEDICARE DIAGNOSIS ENGINE
A self diagnosing medicare engine with expert analysis medical web application to provide healthcare to underprivileged using NLP, LDA, spectral clustering, Google Speech to Text API and APIMedic API.
CANCER PROGNOSIS AND PREDICTION
Developed prediction algorithm using Standard Scalar, PCA, Logistic Regression, Sklearn Pipeline and k fold cross validation, achieving the accuracy of 97%.
REAL ESTATE PRICE ESTIMATOR (PCA, STRATIFIED K FOLD CROSS VALIDATION, RMSE, STANDARD SCORES)
Deployed scatter matrix visualizing the pair-wise correlations, dimensionality reduction using PCA, feature selection using correlation matrix.
GEOSPATIAL DISTRIBUTED COMPUTING (HDFS, APACHE SPARK, SCALA, GEOSPARK)
Performed geospatial database operations on large datasets stored in distributed system, deploying cluster analysis using Ganglia.
Implemented Spatial-Temporal hotspot analysis algorithm, determining the top 50 hotspots for taxi pickups in NY using Getis-ordStatistics.
HAND-WRITTEN DIGITS RECOGNITION (NEURAL NETWORKS, MATLAB, PYTHON)
Implemented one-vs-all logistic regression, backpropagation for Neural networks.
Would you like to learn more about my Astrophysics and Quantum Computation research projects?