Skip to main content
Top expert badge
Recommended expert
Profile header background

Gwang Jin (Josephus) Kim-Data Scientist / Applied AI, Automation & Data Systems Researcher

Gwang Jin (Josephus) Kim - Data Scientist / Applied AI, Automation & Data Systems Researcher - profile avatar
Profile header overlay
Zürich, Switzerland

Check rate

Experience

Jan 2023 - Present
Zürich, Switzerland

Data Scientist / Applied AI, Automation & Data Systems Researcher

Independent

Position Summary
Data Scientist / Applied AI, Automation & Data Systems Researcher at Independent
Industries
Chemical
Information Technology
Business Areas
Information Technology
Research and Development
  • Built and explored applied GenAI, RAG, GraphRAG, local LLM, agentic AI and document-intelligence prototypes for structured analysis, evidence extraction, semantic search, technical reasoning and decision-useful reporting
  • Developed private local-LLM workflows and AI system patterns focused on privacy, reproducibility, reviewability, low-cost inference and practical user control
  • Built reproducible Python/R workflows for data analysis, automation, API-driven tooling, validation logic, technical documentation and AI-assisted software development
  • Designed workflows around explicit assumptions, traceable inputs, reviewable outputs and failure-mode awareness rather than black-box “looks good” demonstrations
  • Supported RAHN AG in a chemical/regulatory environment with data extraction and processing around WERCS, a regulatory application for chemical product and compliance data
  • Explored complex application/database schemas and wrote nested SQL queries to extract information for mixture calculations, component relationships, regulatory rules and reporting logic
  • Continued hands-on development in Git/GitHub/GitLab/Bitbucket, Docker/Linux deployment patterns, REST/API workflows, error handling, technical writing and fast AI-assisted prototyping
  • Built technical writing and documentation workflows that turn complex systems into clear runbooks, checklists, decision notes and user-facing explanations
Jan 2022 - Dec 2023
Basel, Switzerland

Associate Expert Digital Solutions

Novartis AG (via Actalent / Allegis Group)

Position Summary
Associate Expert Digital Solutions at Novartis AG (via Actalent / Allegis Group)
Industries
Biotechnology
Pharmaceutical
Business Areas
Information Technology
Research and Development
  • Built machine-learning and deep-learning models for bioprocess and spectral data, including CNN-based prediction of analytical readouts from Raman/spectral inputs under realistic small-data constraints
  • Designed validation logic, compared model behaviour, analyzed prediction errors and treated model failures as useful signals for data, preprocessing and experimental limitations
  • Developed Python, Streamlit, Flask and REST-API-based tools for process monitoring, data retrieval, visualization, reporting and operational decision support
  • Created a Python wrapper for an internal REST API, making live process and bioreactor data easier to access, inspect, analyze, reuse and explain
  • Automated data extraction, cleaning, reporting, backup and monitoring workflows using Python, PowerShell, Bash, SQL and structured documentation
  • Worked in a regulated biopharma R&D / antibody-production environment where model outputs, dashboards and data workflows had to be understandable, traceable and useful to technical users
  • Supported database-backed application/data workflows around Lucullus, a bioreactor software system used to collect sensor data and support on-site bioreactor operations
  • Worked closely with wet-lab, dry-lab, IT, automation and support teams to clarify workflow problems, validate data behaviour and convert analytical needs into usable tools
  • Helped colleagues and senior data scientists with R/Python scripting, deep learning, model interpretation, practical validation and translating modeling ideas into workflows others could use
Jan 2016 - Dec 2021
Freiburg im Breisgau, Germany

Postdoctoral Bioinformatician / Data Scientist

University of Freiburg, SFB 992 “Medical Epigenetics”

Position Summary
Postdoctoral Bioinformatician / Data Scientist at University of Freiburg, SFB 992 “Medical Epigenetics”
Industries
Biotechnology
Education
Business Areas
Research and Development
  • Led end-to-end analysis of large biomedical datasets, including single-cell RNA-seq, bulk RNA-seq, ChIP-seq and ATAC-seq: QC, preprocessing, differential analysis, annotation, visualization, interpretation and publication support
  • Built reproducible Linux/HPC pipelines for large datasets using Bash, R/Bioconductor, Python, Git, conda environments, Make-style automation and job schedulers
  • Processed structured and semi-structured data from sample sheets, genome annotations, result tables, metadata, large text/annotation files and tool outputs
  • Developed and evaluated ML-oriented approaches for genomics questions, including feature engineering, dimensionality reduction, statistical validation, failure-mode analysis and careful interpretation
  • Designed analysis strategies where conclusions had to survive QC, biological plausibility checks, alternative explanations and collaborator review
  • Advised professors, postdocs, PhD students, MSc students and medical students on analysis design, validity limits, reproducibility, troubleshooting and defensible interpretation
  • Wrote practical HPC, pipeline and workflow documentation that improved usability beyond my own immediate research group
  • Contributed to peer-reviewed publications, including work featured as a Nature Cell Biology cover
Jan 2008 - Dec 2014
Freiburg im Breisgau, Germany

Molecular Biology Researcher – PhD, Molecular Medicine

University of Freiburg

Position Summary
Molecular Biology Researcher – PhD, Molecular Medicine at University of Freiburg
Industries
Education
Business Areas
Research and Development
  • Conducted publication-grade research in human genetics, developmental biology, disease mechanisms, regulatory genomics, molecular biology and experimental model systems
  • Worked with structured experimental documentation, sequence/annotation resources, experimental datasets, control logic, failure sources and reproducible interpretation
  • Built the scientific foundation I still use in ML work: experimental design, controls, causality, confounding, data quality, failure modes and cautious interpretation
  • Worked on gene regulation, Dicer/SOX9/SOX8-related biology and genetic disease contexts connected to high-impact publications
Jan 2005 - Dec 2008
Freiburg im Breisgau, Germany

Master Student and Assistant Student Worker, Molecular Medicine

University of Freiburg

Position Summary
Master Student and Assistant Student Worker, Molecular Medicine at University of Freiburg
Industries
Biotechnology
Education
Business Areas
Quality Assurance
Research and Development
  • Built early quantitative and experimental discipline through biomathematics / biostatistics, qRT-PCR, genotyping, recombinant virology, immunohistology, primary cell culture and structured data interpretation
  • Worked in genetics, internal medicine and virology research environments with a strong focus on evidence, controls, data quality and reproducibility

Industry Experience

See where this freelancer has spent most of their professional time.

Experienced in Education, Biotechnology, Chemical, Information Technology, and Pharmaceutical.

Education
Biotechnology
Chemical
Information Technology
Pharmaceutical
Profile match chart

Business Area Experience

See which departments and functions this freelancer has contributed to most.

Experienced in Research and Development, Information Technology, and Quality Assurance.

Research and Development
Information Technology
Quality Assurance
Profile match chart

Summary

Applied Machine Learning Engineer and PhD-trained natural scientist with 10+ years of data-intensive modeling, scientific computing, automation and machine-learning experience, including direct Novartis R&D work in a regulated biopharma environment. Strong hands-on background in Python, deep learning, CNNs, model evaluation, data pipelines, experimental design, failure analysis, scientific validation and production-adjacent digital tooling.

Strong fit for Vision-Language Model / multimodal AI projects where models must be evaluated carefully, improved iteratively and connected to real-world use cases rather than treated as impressive demos. My core strength is the combination of ML implementation, data-centric experimentation, benchmark design, robustness thinking, and the scientific habit of asking why a model fails before claiming that it works.

For FRATCH’s conversational driving / parking use case, I would position myself as an applied ML builder who can support VLM-based solutions, datasets, evaluation frameworks, prompting / fine-tuning strategies, failure analysis, automated testing, and collaboration with integration or domain engineers. I have not worked in automotive ADAS directly, but I have repeatedly worked where sensor-like data, complex systems, model reliability and domain constraints must meet.

Skills

  • Applied Ml / Deep Learning: Supervised Learning, Cnns, Classification, Regression, Model Adaptation, Model Evaluation, Failure Analysis, Error Taxonomy, Small-Data Modeling, Robustness Checks And Cautious Interpretation

  • Vision / Multimodal Ai Readiness: Computer-Vision Fundamentals, Image/Signal-Style Data Thinking, Multimodal Ai, Vision-Language Models, Foundation-Model Evaluation, Prompt-Based Adaptation, Fine-Tuning Concepts And Model-Output Validation

  • Evaluation And Experimentation: Benchmark Design, Experiment Tracking Discipline, Ablation-Style Thinking, Performance Metrics, Edge-Case Discovery, Failure-Mode Analysis, Model Comparison And Data-Centric Improvement Loops

  • Data Engineering For Ml: Dataset Construction, Preprocessing, Annotation-Aware Thinking, Data Validation, Feature Engineering, Versioned Inputs, Reproducible Pipelines, Automated Reporting And Quality-Control Gates

  • Python Ml Implementation: Python, Pytorch, Tensorflow/Keras, Scikit-Learn, Xgboost, Pandas/Numpy, Jupyter, Streamlit/Flask, Rest Apis, Git, Linux And Reproducible Environments

  • Llm / Genai Systems: Llms, Rag, Graphrag, Embeddings, Document Intelligence, Prompt Engineering, Context Engineering, Structured Outputs, Hallucination Reduction And Reviewable Ai Workflows

  • Real-World Model Reliability: Translating Ambiguous Domain Requirements Into Measurable Tests, Identifying Data Gaps, Documenting Assumptions, Explaining Uncertainty And Building Tools Others Can Inspect And Reuse

  • Cross-Functional Delivery: Working With Scientists, Engineers, It/Support Teams And Domain Experts; Translating Domain Needs Into Technical Concepts, Experiments, Reports And Usable Tools

  • Machine Learning / Deep Learning: Deep Learning, Cnns, Classification, Regression, Supervised Learning, Model Validation, Error Analysis, Robustness Checks, Explainability, Uncertainty Communication, Small-Data Modeling, Scikit-Learn, Xgboost, Pytorch, Tensorflow/Keras

  • Vision / Multimodal Ai: Vision-Language Models, Multimodal Ai, Computer-Vision Fundamentals, Image/Signal-Style Data, Foundation Models, Prompt-Based Adaptation, Fine-Tuning Concepts, Model-Output Evaluation, Edge-Case Analysis

  • Evaluation / Experimentation: Benchmark Design, Datasets For Model Assessment, Experiment Design, Metrics, Failure Taxonomies, Data-Centric Improvement, Validation Gates, Reproducible Reports, Automated Evaluation Concepts

  • Data Engineering: Dataset Construction, Data Cleaning, Preprocessing, Annotation-Aware Workflows, Metadata Handling, Feature Engineering, Data Quality, Versioned Inputs, Pipelines, Structured Exports, Monitoring-Ready Workflows

  • Programming: Python, R, Sql, Bash, Powershell, Git, Linux, Macos, Windows, Common Lisp, Julia Exposure, Javascript Exposure

  • Python Ml/Data Stack: Pandas, Numpy, Scipy, Matplotlib, Plotly, Jupyter, Scikit-Learn, Xgboost, Pytorch, Tensorflow/Keras, Streamlit, Flask, Rest Apis

  • Llm / Genai Systems: Llms, Genai, Rag, Graphrag, Embeddings, Semantic Search, Prompt Engineering, Context Engineering, Local Llms, Document Intelligence, Structured Outputs, Hallucination Reduction, Ai Evaluation

  • Databases / Structured Data: Sql, Relational Databases, Postgresql/Mysql/Mariadb, Sql Server Exposure, Schema Exploration, Joins, Nested Sql, Data Extraction, Data-Quality Checks And Reporting Workflows

  • Deployment-Adjacent Tooling: Docker Exposure, Linux Deployment Patterns, Reproducible Environments, Rest/Api Integration, Logging-Oriented Thinking, Monitoring Logic, Technical Documentation, Ci/Cd Exposure

Languages

German
Native
Korean
Native
English
Advanced

Education

Oct 2008 - Jun 2014

University of Freiburg

PhD, Molecular Medicine · Molecular Medicine · Freiburg im Breisgau, Germany

Oct 2005 - Jun 2008

University of Freiburg

MSc / Diploma, Molecular Medicine · Molecular Medicine · Freiburg im Breisgau, Germany

Certifications & licenses

Agile Software Development: Scrum For Developers

LinkedIn Learning

Data Analysis With Python

freeCodeCamp

Javascript Algorithms And Data Structures

freeCodeCamp

Neo4j Graph Data Science Certification

Neo4j

Scientific Computing With Python

freeCodeCamp

Statistics

Experience

Total positions 5
Experience in Education 16 y
Avg length 4 y 5 m
Longest experience 6 y 11 m

Global Experience

Countries worked in 2 (Germany, Switzerland)
Primary country Germany

Expertise

Recent roles Data Scientist / Applied AI, Automation & Data Systems Researcher, Associate Expert Digital Solutions, Postdoctoral Bioinformatician / Data Scientist
Main industries Education, Biotechnology, Chemical
Main business areas Research and Development, Information Technology, Quality Assurance

Qualifications

Highest degree Doctorate
Certifications earned 5

Profile

Created

Frequently asked questions

Have questions? Find more information here.

Gwang Jin is based in Zürich, Switzerland and can operate in on-site, hybrid, and remote work models.
Gwang Jin speaks the following languages: German (Native), Korean (Native), English (Advanced).
Gwang Jin has at least 20 years of experience. During this time, Gwang Jin has worked in at least 5 different roles and for 4 different companies. The average length of individual experience is 4 years and 1 month. Note that Gwang Jin may not have shared all experience and actually has more experience.
Based on recent experience, Gwang Jin would be well-suited for roles such as: Data Scientist / Applied AI, Automation & Data Systems Researcher, Associate Expert Digital Solutions, Postdoctoral Bioinformatician / Data Scientist.
Gwang Jin's most recent position is Data Scientist / Applied AI, Automation & Data Systems Researcher at Independent.
In recent years, Gwang Jin has worked for Independent, Novartis AG (via Actalent / Allegis Group), University of Freiburg, and SFB 992 “Medical Epigenetics”.
Gwang Jin is most experienced in industries like Education, Biotechnology, and Chemical. Gwang Jin also has some experience in Information Technology and Pharmaceutical.
Gwang Jin is most experienced in business areas like Research and Development, Information Technology, and Quality Assurance.
Gwang Jin has recently worked in industries like Biotechnology, Education, and Chemical.
Gwang Jin has recently worked in business areas like Research and Development and Information Technology.
Gwang Jin holds a Doctorate in Molecular Medicine from University of Freiburg and a Master in Molecular Medicine from University of Freiburg.
Gwang Jin has 5 certificates. Among them, these include: Agile Software Development: Scrum For Developers, Data Analysis With Python, and Javascript Algorithms And Data Structures.
Gwang Jin is immediately available full-time for suitable projects.
Gwang Jin's rate depends on the specific project requirements. Please use the Meet button on the profile to schedule a meeting and discuss the details.
To hire Gwang Jin, click the Meet button on the profile to request a meeting and discuss your project needs.

Average rates for similar positions

Rates are based on recent contracts and do not include FRATCH margin.

1000
750
500
250
Rate comparison chart
Market avg: 760-920 €
The rates shown represent the typical market range for freelancers in this position based on recent contracts on our platform.
Actual rates may vary depending on seniority level, experience, skill specialization, project complexity, and engagement length.