Hi, I’m Furkan.
Founding Engineer / AI Product Engineer

I'm a Founding Engineer and AI Product Engineer building agentic AI systems, RAG platforms, backend infrastructure, browser extensions, and production-grade SaaS products.

Furkan Colhak
Founder & Builder
PhiShark
easyliterature
ExtractMyText

About

Applied AI meets product engineering.

Furkan Colhak is a Berlin-based Founding Engineer and AI Product Engineer focused on building AI-native SaaS products from zero to one. His work spans agentic AI, RAG systems, multi-LLM architectures, backend platforms, cloud infrastructure, browser extensions, cybersecurity, document intelligence, automation, and applied machine learning. He has built and shipped products such as PhiShark and easyliterature, owning system design, microservices, APIs, databases, authentication, security, CI/CD, deployment, and product execution.

Experience

Building products, shipping code.

Built and shipped an AI-first phishing and URL risk analysis SaaS/PaaS from zero to one — owning architecture, backend, cloud infra, browser extension, auth, security, and deployment. Designed a Google ADK-based agentic AI orchestrating 40+ Go microservices for multi-layered URL risk analysis and scoring.

Developed scalable infrastructure with Go, Google Cloud Run, BigQuery, Firestore, Kafka, and cost-optimized AI workflows. Implemented public/private APIs, real-time extension telemetry, auth flows, and integration-ready automation for customer-facing workflows.

Owned full productization including API-first access, usage-based pricing, CI/CD, demo environments, pilot deployments, and go-to-market execution.

Built an AI-native literature review SaaS from zero to one with Next.js, Go REST API, PostgreSQL, Redis, MinIO, and microservices. Designed a multi-LLM architecture supporting Gemini, Claude, OpenAI APIs, and local models via Ollama for search, RAG chat, extraction, and report generation.

Implemented end-to-end ingestion and RAG pipelines with LightRAG vector/graph hybrid retrieval, SSE streaming, semantic search, citation-grounded answers, and block-level evidence tracing. Built scholarly API integration, LLM-first extraction, and Google ADK multi-agent report generation.

Owned system design, backend, auth/security, and full product workflows from database to deployment.

Built a full-stack OCR SaaS from zero to one with Next.js, Go, PostgreSQL, MinIO, and Docker Compose. Designed async OCR workflows with file validation, job queue, polling, markdown rendering, credit-based billing, API key management, and webhooks.

Implemented a flexible provider architecture supporting LlamaParse, Mistral OCR API, and self-hosted models via Ollama, with Go worker pools and resilient background processing. Owned system design, CI/CD, and self-hosted production infrastructure.

Developed a production AI chatbot interface for natural-language security platform interaction — scan triggering, asset/vulnerability lookup via LangChain RAG with Qdrant, Supabase, FastAPI, and vector search. Improved alert accuracy by 8% through prompt engineering and model iteration.

Built and productionized ML detectors (Login Page, File Upload) and developed n8n automation workflows for pentest report validation, scanner comparison, and LLM-powered summarization.

Led applied AI/ML research across NLP, computer vision, Transformers, and cybersecurity. Built automated pipelines for large-scale data processing, model training, and evaluation. Created the MTLP-Dataset with 100K+ samples for ML research.

Authored peer-reviewed papers and a book chapter in applied AI, ML, computer vision, and cybersecurity. Translated research into applied prototypes and production-ready ML systems.

Assisted in delivering university courses on Data Intelligence and Statistical Analysis — data preprocessing, visualization, hypothesis testing, statistical analysis, and linear models.

Skills

Full-stack AI product engineering.

A
Agentic AIAI, LLMs & Agentic SystemsAI-Assisted Software Development WorkflowsAutomation & AI-Assisted DevelopmentAPI DesignBackend & Systems EngineeringAPI KeysSecurity & Product InfrastructureAsync WorkersBackend & Systems EngineeringAuthenticationBackend & Systems EngineeringAuthorizationBackend & Systems Engineering
B
BashServer Administration & SystemsBigQueryDatabases, Storage & Data PipelinesBrowser ExtensionsFrontend & Product DevelopmentBurp SuiteSecurity & Product Infrastructure
C
CaddyCloud, DevOps & InfrastructureCaddyServer Administration & SystemsChatGPT-compatible APIsAI, LLMs & Agentic SystemsChromaRAG & Knowledge SystemsCI/CDCloud, DevOps & InfrastructureCitation-Grounded AnswersRAG & Knowledge SystemsClassical ML ModelsMachine Learning & Applied AIClaudeAI, LLMs & Agentic SystemsClaude CodeAutomation & AI-Assisted DevelopmentClient ComponentsFrontend & Product DevelopmentCloud LoggingCloud, DevOps & InfrastructureCodexAutomation & AI-Assisted DevelopmentCron JobsAutomation & AI-Assisted DevelopmentCursorAutomation & AI-Assisted Development
D
DashboardsFrontend & Product DevelopmentData ValidationDatabases, Storage & Data PipelinesDataset CreationDatabases, Storage & Data PipelinesDockerCloud, DevOps & InfrastructureDocker ComposeCloud, DevOps & InfrastructureDomain/DNS ConfigurationServer Administration & Systems
E
EmbeddingsRAG & Knowledge SystemsETL PipelinesDatabases, Storage & Data PipelinesEvidence TracingRAG & Knowledge Systems

Publications

Authored & co-authored research.

01

Privacy-Preserving Smart Surveillance with Cross-Dataset Violence Detection and Decentralized Evidence Governance

H Coşkun, F Çolhak, A Kulakov, V Dimitrova

CyberMACS – International Applied Cybersecurity Conference IACyC 2026·2026
02

Accelerating IoV Intrusion Detection: Benchmarking GPU-Accelerated vs CPU-Based ML Libraries

F Çolhak, H Coşkun, TNR Cyrille, T Hoxa, Mİ Ecevit, MN Aydın

CIIT The International Conference on Informatics and Information Technologies (CiiT 2025)·2025
03

Cybersecurity Monitoring in Vital Utilities Infrastructure: Integrating Specialized Open-Source Intelligence Tools

F Çolhak, MI Ecevit, H Dağ, R Creutzburg

Advances in Intelligent Systems, IFSA Publishing·2025
04

Phishing Website Detection Through Multi-model Analysis of HTML Content

F Çolhak, MI Ecevit, BE Uçar, R Creutzburg, H Dağ

International Conference on Theoretical and Applied Computing (ICTAC), Springer·2024
05

Transfer Learning for Phishing Detection: Screenshot-Based Website Classification

F Çolhak, Mİ Ecevit, H Dağ

IEEE UBMK 2024·2024
06

SecureReg: Combining NLP and MLP for Enhanced Detection of Malicious Domain Name Registrations

F Çolhak, MI Ecevit, H Dağ, R Creutzburg

IEEE International Conference on Electrical, Computer and Energy Technologies (ICECET)·2024
07

Comparing Deep Neural Networks and Machine Learning for Detecting Malicious Domain Name Registrations

F Çolhak, MI Ecevit, H Dağ, R Creutzburg

IEEE COINS 2024·2024
08

Garbage in, Garbage Out: A Case Study on Defective Product Prediction in Manufacturing

F Çolhak, BE Uçar, İ Saygut, B Düzgün, F Demirkıran, H Dağ

IEEE UBMK 2023·2023

Get in Touch

Let’s build something.

Open to founding engineer roles, applied AI engineering roles, technical collaborations, AI product consulting, and research-driven product opportunities.