About Me
PK
I'm a Computer Science undergraduate with a deep passion for AI and Data Science. I have proficiency and hands-on experience with deep learning frameworks, LLM architectures, and data analytics pipelines.
Currently focused on building intelligent AI agents, RAG systems, and expanding my MLOps skillset.
Skills
C/C++PythonJavaScriptHTML/CSSSQLRAGGraphRAGGenAILLMsNLPAgentic WorkflowsStreamlitFlaskDockerLlamaIndexLangChainLangGraphNeo4jFastAPITensorFlowPyTorchPandasNumpyScikit-learnGitGitHubVS CodeGoogle ColabLinux
Experience
Data Analyst Intern
June 2025 - July 2025TrendalyTix
- Architected predictive models using Scikit-learn and Pandas to analyze real-world datasets, improving data-driven decision accuracy.
- Optimized automated data cleaning pipelines for production-grade analytics workflows.
- Synthesized complex technical findings into actionable business insights, presenting stakeholders with interactive visualizations.
PythonPandasScikit-learnData Visualization
Artificial Intelligence Trainer (Freelance)
Dec 2024 - Feb 2025Outlier
- Optimized Large Language Model (LLM) performance through Advanced Prompt Engineering and Chain-of-Thought reasoning across STEM and coding domains.
- Executed RLHF (Reinforcement Learning from Human Feedback) workflows to evaluate and refine model outputs, ensuring technical accuracy and ethical alignment.
Prompt EngineeringRLHFLLMsChain-of-Thought
Projects
Agentic Data Analyst
- Developed an autonomous Cognitive Agent using LangGraph that plans analysis steps, executes Python code, and autonomously resolves runtime errors via recursive reflection loops (Self-Healing).
- Deployed a secure, containerized application using Docker on Render with a Streamlit interface, implementing sandbox guardrails.
LangGraphLlama-3.3DockerStreamlitPython
Check Source CodeImage Captioning
- Designed a multimodal deep learning pipeline utilizing CNNs (Encoder) for spatial feature extraction and LSTMs (Decoder) for sequential natural language generation.
- Enhanced model accuracy on the Flickr8k dataset using Transfer Learning with pre-trained vision models.
PythonTensorFlow/KerasNLPCNNLSTM
Check Source CodeWeather Forecasting
- Developed a weather forecasting model to predict future conditions by analyzing historical time-series data.
- Built an end-to-end data pipeline using Machine Learning techniques in Python and Pandas.
PythonPandasScikit-learnData Analysis
Check Source CodeBioGraph-RAG
- Engineered a GraphRAG architecture combining Neo4j Knowledge Graphs with Vector Search to resolve complex multi-hop medical queries in unstructured clinical PDFs.
- Orchestrated Llama-3 (70B) via Groq for high-precision Named Entity Recognition (NER), mapping complex Drug-Protein-Disease relationships into a structured graph schema.
Llama-3Neo4jLlamaIndexPython
Check Source Code