Hi, I’m Furkan.
Founding Engineer / AI Product Engineer

I'm a Founding Engineer and AI Product Engineer building agentic AI systems, RAG platforms, backend infrastructure, browser extensions, and production-grade SaaS products.

View Products Contact Me

me@furkancolhak.com

Founder & Builder

About

Applied AI meets product engineering.

Furkan Colhak is a Berlin-based Founding Engineer and AI Product Engineer focused on building AI-native SaaS products from zero to one. His work spans agentic AI, RAG systems, multi-LLM architectures, backend platforms, cloud infrastructure, browser extensions, cybersecurity, document intelligence, automation, and applied machine learning. He has built and shipped products such as PhiShark and easyliterature, owning system design, microservices, APIs, databases, authentication, security, CI/CD, deployment, and product execution.

Products

AI-native SaaS, built from zero to one.

AI-powered phishing and URL risk analysis platform.

PhiShark is an AI-first B2B SaaS/PaaS platform for phishing and URL risk analysis. It mimics a human security analyst using agentic AI workflows and orchestrates 40+ Go REST microservices for URL, domain, DNS, TLS, infrastructure, web content, proxy analysis, enrichment, and risk scoring.

AI-native literature review platform built for researchers.

easyliterature is an AI-native literature review SaaS platform that helps researchers discover papers, parse documents, run RAG-based research chats, extract structured evidence, and generate research reports using multi-agent workflows.

OCR and document intelligence SaaS.

ExtractMyText is an OCR and document intelligence product for extracting text from PDFs and images. It includes asynchronous OCR workflows, provider-based OCR architecture, API access, billing-oriented product flows, and self-hosted deployment patterns. Some OCR/document-processing concepts and services are also reused inside easyliterature.

Experience

Building products, shipping code.

Built and shipped an AI-first phishing and URL risk analysis SaaS/PaaS from zero to one — owning architecture, backend, cloud infra, browser extension, auth, security, and deployment. Designed a Google ADK-based agentic AI orchestrating 40+ Go microservices for multi-layered URL risk analysis and scoring.

Developed scalable infrastructure with Go, Google Cloud Run, BigQuery, Firestore, Kafka, and cost-optimized AI workflows. Implemented public/private APIs, real-time extension telemetry, auth flows, and integration-ready automation for customer-facing workflows.

Owned full productization including API-first access, usage-based pricing, CI/CD, demo environments, pilot deployments, and go-to-market execution.

Built an AI-native literature review SaaS from zero to one with Next.js, Go REST API, PostgreSQL, Redis, MinIO, and microservices. Designed a multi-LLM architecture supporting Gemini, Claude, OpenAI APIs, and local models via Ollama for search, RAG chat, extraction, and report generation.

Implemented end-to-end ingestion and RAG pipelines with LightRAG vector/graph hybrid retrieval, SSE streaming, semantic search, citation-grounded answers, and block-level evidence tracing. Built scholarly API integration, LLM-first extraction, and Google ADK multi-agent report generation.

Owned system design, backend, auth/security, and full product workflows from database to deployment.

Built a full-stack OCR SaaS from zero to one with Next.js, Go, PostgreSQL, MinIO, and Docker Compose. Designed async OCR workflows with file validation, job queue, polling, markdown rendering, credit-based billing, API key management, and webhooks.

Implemented a flexible provider architecture supporting LlamaParse, Mistral OCR API, and self-hosted models via Ollama, with Go worker pools and resilient background processing. Owned system design, CI/CD, and self-hosted production infrastructure.

Developed a production AI chatbot interface for natural-language security platform interaction — scan triggering, asset/vulnerability lookup via LangChain RAG with Qdrant, Supabase, FastAPI, and vector search. Improved alert accuracy by 8% through prompt engineering and model iteration.

Built and productionized ML detectors (Login Page, File Upload) and developed n8n automation workflows for pentest report validation, scanner comparison, and LLM-powered summarization.

Led applied AI/ML research across NLP, computer vision, Transformers, and cybersecurity. Built automated pipelines for large-scale data processing, model training, and evaluation. Created the MTLP-Dataset with 100K+ samples for ML research.

Authored peer-reviewed papers and a book chapter in applied AI, ML, computer vision, and cybersecurity. Translated research into applied prototypes and production-ready ML systems.

Assisted in delivering university courses on Data Intelligence and Statistical Analysis — data preprocessing, visualization, hypothesis testing, statistical analysis, and linear models.

Skills

Full-stack AI product engineering.

Agentic AIAI, LLMs & Agentic SystemsAI-Assisted Software Development WorkflowsAutomation & AI-Assisted DevelopmentAPI DesignBackend & Systems EngineeringAPI KeysSecurity & Product InfrastructureAsync WorkersBackend & Systems EngineeringAuthenticationBackend & Systems EngineeringAuthorizationBackend & Systems Engineering

BashServer Administration & SystemsBigQueryDatabases, Storage & Data PipelinesBrowser ExtensionsFrontend & Product DevelopmentBurp SuiteSecurity & Product Infrastructure

CaddyCloud, DevOps & InfrastructureCaddyServer Administration & SystemsChatGPT-compatible APIsAI, LLMs & Agentic SystemsChromaRAG & Knowledge SystemsCI/CDCloud, DevOps & InfrastructureCitation-Grounded AnswersRAG & Knowledge SystemsClassical ML ModelsMachine Learning & Applied AIClaudeAI, LLMs & Agentic SystemsClaude CodeAutomation & AI-Assisted DevelopmentClient ComponentsFrontend & Product DevelopmentCloud LoggingCloud, DevOps & InfrastructureCodexAutomation & AI-Assisted DevelopmentCron JobsAutomation & AI-Assisted DevelopmentCursorAutomation & AI-Assisted Development

DashboardsFrontend & Product DevelopmentData ValidationDatabases, Storage & Data PipelinesDataset CreationDatabases, Storage & Data PipelinesDockerCloud, DevOps & InfrastructureDocker ComposeCloud, DevOps & InfrastructureDomain/DNS ConfigurationServer Administration & Systems

EmbeddingsRAG & Knowledge SystemsETL PipelinesDatabases, Storage & Data PipelinesEvidence TracingRAG & Knowledge Systems

Publications

Authored & co-authored research.

Privacy-Preserving Smart Surveillance with Cross-Dataset Violence Detection and Decentralized Evidence Governance

H Coşkun, F Çolhak, A Kulakov, V Dimitrova

CyberMACS – International Applied Cybersecurity Conference IACyC 2026·2026

Accelerating IoV Intrusion Detection: Benchmarking GPU-Accelerated vs CPU-Based ML Libraries

F Çolhak, H Coşkun, TNR Cyrille, T Hoxa, Mİ Ecevit, MN Aydın

CIIT The International Conference on Informatics and Information Technologies (CiiT 2025)·2025

Cybersecurity Monitoring in Vital Utilities Infrastructure: Integrating Specialized Open-Source Intelligence Tools

F Çolhak, MI Ecevit, H Dağ, R Creutzburg

Advances in Intelligent Systems, IFSA Publishing·2025

Phishing Website Detection Through Multi-model Analysis of HTML Content

F Çolhak, MI Ecevit, BE Uçar, R Creutzburg, H Dağ

International Conference on Theoretical and Applied Computing (ICTAC), Springer·2024

Transfer Learning for Phishing Detection: Screenshot-Based Website Classification

F Çolhak, Mİ Ecevit, H Dağ

IEEE UBMK 2024·2024

SecureReg: Combining NLP and MLP for Enhanced Detection of Malicious Domain Name Registrations

F Çolhak, MI Ecevit, H Dağ, R Creutzburg

IEEE International Conference on Electrical, Computer and Energy Technologies (ICECET)·2024

Comparing Deep Neural Networks and Machine Learning for Detecting Malicious Domain Name Registrations

F Çolhak, MI Ecevit, H Dağ, R Creutzburg

IEEE COINS 2024·2024

Garbage in, Garbage Out: A Case Study on Defective Product Prediction in Manufacturing

F Çolhak, BE Uçar, İ Saygut, B Düzgün, F Demirkıran, H Dağ

IEEE UBMK 2023·2023

View all on Google Scholar

Get in Touch

Let’s build something.

Open to founding engineer roles, applied AI engineering roles, technical collaborations, AI product consulting, and research-driven product opportunities.

Send an email View GitHub Connect on LinkedIn

Hi, I’m Furkan.
Founding Engineer / AI Product Engineer

Applied AI meets product engineering.

AI-native SaaS, built from zero to one.

Building products, shipping code.

PhiShark

easyliterature

ExtractMyText

S4E (Security for Everyone)

KHAS Research Center on Cybersecurity and Critical Infrastructure Protection

Kadir Has University

Full-stack AI product engineering.

Authored & co-authored research.

Privacy-Preserving Smart Surveillance with Cross-Dataset Violence Detection and Decentralized Evidence Governance

Accelerating IoV Intrusion Detection: Benchmarking GPU-Accelerated vs CPU-Based ML Libraries

Cybersecurity Monitoring in Vital Utilities Infrastructure: Integrating Specialized Open-Source Intelligence Tools

Phishing Website Detection Through Multi-model Analysis of HTML Content

Transfer Learning for Phishing Detection: Screenshot-Based Website Classification

SecureReg: Combining NLP and MLP for Enhanced Detection of Malicious Domain Name Registrations

Comparing Deep Neural Networks and Machine Learning for Detecting Malicious Domain Name Registrations

Garbage in, Garbage Out: A Case Study on Defective Product Prediction in Manufacturing

Let’s build something.

Hi, I’m Furkan.Founding Engineer / AI Product Engineer

Applied AI meets product engineering.

AI-native SaaS, built from zero to one.

Building products, shipping code.

PhiShark

easyliterature

ExtractMyText

S4E (Security for Everyone)

KHAS Research Center on Cybersecurity and Critical Infrastructure Protection

Kadir Has University

Full-stack AI product engineering.

Authored & co-authored research.

Privacy-Preserving Smart Surveillance with Cross-Dataset Violence Detection and Decentralized Evidence Governance

Accelerating IoV Intrusion Detection: Benchmarking GPU-Accelerated vs CPU-Based ML Libraries

Cybersecurity Monitoring in Vital Utilities Infrastructure: Integrating Specialized Open-Source Intelligence Tools

Phishing Website Detection Through Multi-model Analysis of HTML Content

Transfer Learning for Phishing Detection: Screenshot-Based Website Classification

SecureReg: Combining NLP and MLP for Enhanced Detection of Malicious Domain Name Registrations

Comparing Deep Neural Networks and Machine Learning for Detecting Malicious Domain Name Registrations

Garbage in, Garbage Out: A Case Study on Defective Product Prediction in Manufacturing

Let’s build something.

Hi, I’m Furkan.
Founding Engineer / AI Product Engineer