Data Analytics | Machine Learning | Artificial Intelligence

Hello, I'm Aleena Varghese.

I am an early-career data professional with a strong foundation in M.Sc. Computer Science (Data Analytics). I specialize in transforming raw data into actionable insights and building practical AI/ML solutions. My focus involves cleaning complex datasets, building predictive models, and exploring Generative AI applications using Python and Cloud tools.

Aleena Varghese

Technical Skills

Programming & Data

Python Pandas NumPy SQL

Data Analytics

Data Cleaning EDA Probability & Stats Time Series Analysis

Machine Learning

Regression SVM Time Series forecasting (LSTM) CNNs OpenCV

NLP & GenAI

Text Preprocessing RAG LangChain Vector Databases

Cloud & Tools

Google Cloud Platform BigQuery GitHub Jupyter Streamlit Docker

Featured Projects

Time Series Forecasting with LSTM

TensorFlow Keras Scikit-learn

Problem Statement

Predicting future values in sequential data to assist in trend analysis.

Approach

Implemented an LSTM neural network utilizing windowing techniques for data preparation and normalization.

Outcome

Successfully compared the deep learning model against baseline statistical approaches, evaluating performance using RMSE and MAE metrics used for time-series validation.

Age, Gender & Emotion Prediction using CNN

Python OpenCV CNN

Problem Statement

Analyzing facial images to classify demographic attributes and emotional states.

Approach

Built a Convolutional Neural Network (CNN) with data augmentation to handle class imbalance. Integrated OpenCV for image processing.

Key Learnings

Gained deep understanding of bias in datasets and ethical considerations in facial recognition technology.

Sign Language Communication Assistant

OpenCV SVM Gemini AI

Problem Statement

Bridging the communication gap for sign language users through real-time recognition.

Approach

Collected gesture data to train an SVM classifier for static signs. Integrated Gemini API to convert recognized signs into natural text and audio.

Outcome

Created a hybrid system combining traditional ML (SVM) with modern GenAI capabilities for enhanced accessibility.

AI-Powered Gmail Chatbot (RAG)

LangChain ChromaDB LLM APIs

Problem Statement

Managing large volumes of email efficiently using natural language queries.

Approach

Built a Retrieval-Augmented Generation (RAG) pipeline. Cleaned Gmail data, created vector embeddings, and used LangChain for semantic search and response generation.

Outcome

Enabled rule-based categorization and context-aware responses, streamlining email management.

Education & Certifications

Education

M.Sc. Computer Science (Data Analytics)
Digital University Kerala, Thiruvananthapuram
B.Sc. Mathematics
Mar Ivanios College, Thiruvananthapuram

Certifications

Google Cloud — Data Analytics Track
Issued by Google Cloud
Google Cloud — Computing Foundations Track
Issued by Google Cloud
Deloitte — Data Analytics Job Simulation
Forage
Tata Group — Data Visualization
Forage
Python Data Structures
Coursera
Introduction to Large Language Models
Google Cloud

Get In Touch

I am currently open to opportunities in Data Analytics, Data Science, and AI/ML. Send me a message using the form below.

Or reach out directly via: