AI systems / simulation / data quality

Veli Ates

M.Sc. Embedded Systems Engineering candidate at FH Dortmund. I build practical AI and simulation systems with a focus on dataset auditing, edge deployment, evidence-grounded retrieval, and source-backed biological modeling.

~2M images analyzed with DiversityLens
200+ tests across the Cell simulation
DFKI master's thesis completed
Research Direction

Computational biology, stochastic simulation, computer vision dataset auditing, and embedded AI.

Working Style

Build the pipeline, document the assumptions, test the outputs, and keep the evidence trace visible.

Current Focus

Junior roles in AI evaluation, research engineering, data quality, and applied AI tooling.

Experience

Research and work

A compact view of the work that shaped the portfolio: thesis research, high-volume quality review, and engineering projects that make model behavior inspectable.

Oct 2025 -> Jun 2026

Master's Thesis Researcher, DFKI Robotics Innovation Center

Completed DiversityLens, a Python toolkit for demographic analysis of robot-human interaction and computer vision datasets using OpenCV, RetinaFace, DeepFace, Pandas, Bokeh, pytest, CI, and CLI workflows.

Oct 2022 -> Present

Content Moderator, TELUS Digital

High-volume content quality and policy review work for a major social media platform, with attention to ambiguous edge cases and consistent decision quality.

Oct 2021 -> 2026

M.Sc. Embedded Systems Engineering, FH Dortmund

Coursework and project work in machine learning, computer vision, model-based systems engineering, microelectronics, and hardware/software co-design.

Selected projects

Clean demos, real systems

Each project has a direct demo or repository link. The demos are deployed inside this portfolio so reviewers can inspect the work without hunting through local folders.

Python package / DFKI

DiversityLens

Dataset-auditing toolkit for demographic analysis across large-scale computer vision datasets. The work scaled to roughly two million analyzed images and produced reproducible reports for bias and balance inspection.

Audit footprint
Images
~2M
Reports
Bokeh
Quality
CI
  • OpenCV
  • DeepFace
  • RetinaFace
  • Pandas
  • Bokeh
  • Pytest
Edge AI / C++

Edge AI

A compact neural-net project that trains an MLP in PyTorch with a manual training loop, exports weights and biases, and moves toward C++ int8 inference for edge deployment.

Deployment path
Float32
1.0x
Int8
0.25x
Epochs
1200
  • PyTorch
  • C++
  • Int8
  • Embedded AI
  • Manual Backprop
Simulation / biology

Cell

A source-grounded stochastic whole-cell simulation of a hepatocyte with a Python biochemical engine and TypeScript/Three.js visualizer. The project tracks measured targets and keeps assumptions explicit.

Validation snapshot
Tests
200+
ATP
valid
Redox
valid
  • Python
  • TypeScript
  • Three.js
  • Stochastic Simulation
  • Biology
RAG / FastAPI

AI Evidence Assistant

A semantic document search and grounded chat system. It ingests TXT, PDF, and DOCX files, chunks text with overlap, embeds with sentence-transformers, and retrieves cited passages by cosine similarity.

Retrieval behavior
Top 1
0.92
Top 2
0.86
Top 3
0.71
  • FastAPI
  • Sentence Transformers
  • SQLite
  • Docker
  • Pytest

Toolkit

What I work with

A practical stack for AI systems that need more than a demo: data processing, evaluation, reproducibility, and a clear path from model output to evidence.

Programming

Python, Bash, TypeScript, C++ fundamentals, Git, Linux, LaTeX.

ML and Vision

PyTorch, scikit-learn, OpenCV, DeepFace, RetinaFace, NumPy, Pandas.

Retrieval and APIs

RAG workflows, FAISS, Sentence Transformers, Hugging Face, FastAPI, SQLite.

Scientific Modeling

Gillespie SSA, chemical Langevin, reaction-diffusion, numerical methods, Three.js, pytest.

Contact

Open to AI evaluation, research engineering, and applied AI roles.