About | Bijaya Kumar Pariyar

My Story

I'm a self-taught Data Scientist with deep hands-on experience building end-to-end machine learning solutions across classification, regression, clustering, anomaly detection, and time series forecasting.

What started as curiosity about data turned into a full-stack journey through the ML ecosystem. I've gone from training my first Random Forest to building production-grade RAG systems integrating LLMs like Llama-3.3-70B via Groq — all without a formal CS degree, driven entirely by self-study and project-based learning.

Currently working as a Data Scientist Intern at CR Equity AI, Inc. (remote, Tallahassee, FL, USA), where I design and implement production-grade RAG pipelines, integrate LLM APIs, and build FastAPI services for document intelligence. My work spans classical ML to cutting-edge AI systems with a strong focus on security, scalability, and real-world deployment.

I've completed 20+ structured ML projects handling large-scale datasets — 1M+ rows for fraud detection and 1.7M+ rows for time series forecasting. My approach combines rigorous evaluation, model explainability through SHAP values, and reproducible research workflows.

What I Stand For

Rigorous

Proper evaluation, CV splits, no data leakage — results you can trust.

Explainable

SHAP values and feature importance — black-box outputs become actionable insights.

Production-Ready

Docker, FastAPI, JWT auth, rate limiting — models that actually ship.

Quick Info

Education

BCA, Tribhuvan University

Location

Kathmandu, Nepal

Current Role

Data Scientist Intern @ CR Equity AI

Work Mode

Remote (Tallahassee, FL, USA)

Languages

Nepali (Native), English (Professional)

Interests

AI Research, RAG Systems, Open Source

By The Numbers

ML Projects20+

Rows Processed1M+

GitHub Repos50+

Research Papers1

Tech Stack Size25+

Connect With Me

GitHub LinkedIn Kaggle bijaybeezoe@gmail.com

Career Journey

From self-study to production AI systems

Dec 2025 – Present

Data Scientist Intern

Current

CR Equity AI, Inc. · Remote (Tallahassee, FL, USA)

Building production-grade RAG pipelines, LLM integrations, and FastAPI services for document intelligence. Collaborating with a US-based engineering team using Agile practices.

RAG FastAPI LLMs FAISS Qdrant Docker

2024 – 2025

Entering the AI & RAG Ecosystem

Self-directed Research & Projects

Transitioned from classical ML to generative AI. Built RAG systems with FAISS and Qdrant, explored vector embeddings, cross-encoder reranking, and LLM API integration. Developed a production-grade Personal Knowledge Base RAG API.

RAG Vector DBs Embeddings LLMs FastAPI

2024

Research Preprint Published

Independent Research

Published preprint on SHAP-Based Feature Selection and Iterative Hyperparameter Tuning for Customer Churn Prediction in Telecommunication Datasets — demonstrating interpretability and optimization in production ML models.

SHAP Hyperparameter Tuning Research

2023 – 2024

Deep Dive into ML

Self-directed Learning

Built 20+ end-to-end ML projects spanning classification, regression, time series, clustering, and anomaly detection. Handled 1M+ row datasets, implemented SHAP explainability, and developed web applications around ML models including ChurnShield.

Scikit-learn XGBoost LightGBM Flask SHAP

2020 – Present

Bachelor of Computer Applications (BCA)

Tribhuvan University · Kathmandu, Nepal

Formal education in computer science fundamentals — algorithms, databases, programming, and software engineering. Applied formal knowledge through independent data science and AI projects running in parallel with academic studies.

Research

Preprint · 2024

SHAP-Based Feature Selection and Iterative Hyperparameter Tuning for Customer Churn Prediction in Telecommunication Datasets

Explores interpretability and optimization techniques for churn modeling, demonstrating SHAP values for transparent feature importance and iterative tuning for enhanced model performance on real-world telecom data. Bridges the gap between black-box ML models and actionable business insights.

SHAP Hyperparameter Tuning Churn Prediction Telecom Explainability Random Forest

Bijaya Kumar Pariyar