Emad Kalantari

MSc Information & Computer Sciences · AI · Machine Learning · Data Engineering · Data Science · Luxembourg

Work Experience
Jul 2024 – Feb 2026
Applied AI R&D — LLM Pipelines & NLP
Luxembourg Centre for Contemporary and Digital History, University of Luxembourg
  • Designed and implemented a modular, end-to-end NLP pipeline for entity and relationship extraction from long-form historical documents, integrating text cleaning, overlapping chunking (LangChain), LLM-based coreference resolution, and structured JSON output via OpenAI APIs (GPT-4o, GPT-4.1, GPT-5, o3).
  • Introduced a dedicated coreference resolution stage to disambiguate pronouns and collective entity references prior to extraction, independently increasing attribution accuracy by ~20%; benchmarked 43 experimental configurations varying chunking, context windows, prompting strategies, and model selection across a curated ground-truth dataset of 176 annotated relationships.
  • Achieved a 250% F1-score improvement over the naïve baseline through iterative prompt engineering and pipeline refinement; validated reproducibility via 10-run stability testing, identifying OpenAI o3 as the optimal model for reasoning-intensive extraction tasks.
  • Research formed the basis of an interdisciplinary Master's thesis at the intersection of NLP and Digital History; findings are being prepared for publication in a peer-reviewed journal in collaboration with supervisors at C²DH.
Mar 2024 – Jun 2024
Project Assistant — Digital Platforms & UX
Department of Computer Science, University of Luxembourg
  • Conducted structured usability and content reviews of departmental digital platforms, identifying broken links, layout inconsistencies, and content errors to improve overall platform quality.
  • Documented and reported reproducible issues to development and administrative teams, supporting iterative improvements in platform stability and user experience.
  • Designed and deployed Google Forms to collect structured student input for events and departmental initiatives, supporting organized data collection and stakeholder coordination.
Mar 2022 – Mar 2023
Data Analyst Intern
Research & Development Center, ATEC Consultants Co.
  • Collected and processed experimental urban design datasets; performed data cleaning, transformation, and exploratory analysis using Python (Pandas, NumPy) to support architecture and planning workflows.
  • Developed automated Python scripts to preprocess urban design datasets, eliminating repetitive manual preparation steps and improving data pipeline efficiency by 10–15%.
  • Built data visualizations (Matplotlib, Seaborn) to communicate analytical findings to architects and project managers during internal design reviews.
  • Contributed to technical documentation and presented analysis results to cross-functional teams spanning data, architecture, and project management.
Education
Sep 2023 – Feb 2026
Master's in Information and Computer Sciences
University of Luxembourg
Sep 2016 – Sep 2021
Bachelor's in Computer Engineering
Azad University of Tehran Central Branch
Technical Skills
Languages
Python R SQL
ML & DL Frameworks:
PyTorch scikit-learn XGBoost Torchvision Ultralytics Albumentations
LLM & NLP
Hugging Face Transformers LangChain OpenAI API Anthropic API Prompt Engineering RAG ChromaDB
Data Processing
Pandas NumPy Scikit-learn XGBoost SHAP imbalanced-learn
Data Engineering
PySpark Apache Airflow dbt PostgreSQL SQLite AWS S3 boto3 Parquet ETL/ELT Pipelines Medallion Architecture
Data Ingestion & APIs
REST API integration OpenAQ API JSON processing
MLOps & DevOps
Docker GitHub Actions (CI/CD) FastAPI pytest Git GitHub
Visualisation & BI
Matplotlib Seaborn ggplot2 Plotly Tableau Power BI Streamlit
Cloud & Compute
AWS S3 University of Luxembourg HPC (Tesla V100) SLURM CUDA
Interpersonal Skills
Collaboration Analytical Thinking Problem-solving Fast Learner Attention to detail