Jason Thomas, PhD

Jason Thomas, PhD

Healthcare Data Operations & AI/MLOps Platform Leader | Multimodal Medical Data & Partnerships

Philips Image Guided Therapy Devices

Biography

I lead healthcare data operations, regulated AI/MLOps platform strategy, and data governance for medical-device R&D. My work sits between engineering, clinical informatics, product AI, regulatory evidence, privacy, and business strategy: translating product roadmaps into data demand/supply models, acquisition and annotation priorities, fit-for-purpose criteria, platform roadmaps, and reusable assets for development, validation, analytics, and real-world evidence.

At Philips Image Guided Therapy Devices, I built and now lead a distributed Data & AI Platform Engineering function spanning data sourcing, curation, annotation, data/ML engineering, infrastructure, MLOps, stewardship, governance, and real-world data. I manage a multi-million-dollar platform budget, support more than 100 developers and analysts, and help shape healthcare data and AI roadmaps, investment logic, success metrics, and executive dashboards across senior R&D and business-unit leadership.

My background also spans biomedical informatics research, real-world evidence, synthetic-data validation, medical NLP, ECG/VCG/ECGi, speech/audio biomarkers, clinical imaging context, and intraprocedural multimodal data. Over time, I am especially interested in the data, validation, and platform foundations required for physical AI in healthcare, including AI-assisted procedures, medical robotics, simulation, and sensor-fusion workflows.

Download Jason’s resume.

Interests
  • Healthcare data operations and strategy
  • Regulated AI/MLOps platforms
  • Multimodal medical and sensor data
  • Data partnerships, licensing, and governance
  • Real-world evidence and clinical informatics
  • Privacy-preserving data utility and validation
  • Medical data standards and interoperability
  • High-performance technical team building
Education
  • PhD Biomedical Informatics; Data Science Specialization

    University of Washington - Seattle WA

  • Bachelor of Science - Human Physiology | Biology (Chem minor)

    University of Oregon - Eugene OR

Skills

Healthcare Data Strategy

Roadmap-driven data demand/supply modeling, fit-for-purpose criteria, acquisition and annotation portfolios, investment metrics, and release planning.

Regulated AI/MLOps

Governed data platforms for AI development, validation, analytics, and real-world evidence.

Multimodal Medical Data

Imaging context, EHR/claims, ECG/VCG/ECGi, audit logs, speech/audio biomarkers, and intraprocedural data.

Partnerships & Governance

Data-use agreements, licensing decisions, active learning and annotation operations, de-identification, provenance, and access controls.

Lakehouse & Data Engineering

Data lakes/lakehouses, data contracts, lineage, dataset versioning, quality gates, and reproducible pipelines.

Medical AI Governance

HIPAA, GDPR, IRB workflows, EU MDR, EU AI Act readiness, QMS-aligned controls, and regulated evidence.

Clinical Informatics & RWE

Computable phenotypes, clinical features, post-market surveillance, real-world evidence, and PMCF analytics.

Standards & Vocabularies

OMOP, HL7 FHIR, SNOMED, LOINC, ICD-10, CPT, HCPCS, RxNorm, UMLS, and medical data harmonization.

Cloud, Data & Compute

Python, SQL, AWS, Airflow, PySpark, Parquet, Redshift, Athena, SageMaker, ClearML, GitHub Actions, and Linux.

Team & Platform Leadership

Hiring, mentoring, platform operating models, technical roadmaps, budget ownership, and cross-functional leadership.

Experience

 
 
 
 
 
Head of Data & AI Platform Engineering
Oct 2024 – Present Bothell, WA, USA
  • Own healthcare data operations and AI/MLOps platform strategy for a global medtech R&D organization, translating product roadmaps into data demand/supply models, acquisition and annotation priorities, fit-for-purpose criteria, governance controls, platform roadmaps, investment metrics, and executive dashboards.
  • Built and now lead a distributed 0-to-8 platform function across data sourcing, curation, annotation, data/ML engineering, infrastructure, MLOps, stewardship, governance, and clinical informatics.
  • Manage a multi-million-dollar platform budget and support 100+ developers, analysts, and product AI/algorithm teams through governed self-service data access.
  • Set portfolio priorities for healthcare data partnerships, licensed data, labeled and unlabeled data needs, vendor/tooling choices, annotation operations, lakehouse architecture, and AI-ready dataset reuse.
  • Architect platform patterns for DICOM-linked clinical/imaging data, real-world evidence, and intraprocedural multimodal medical/sensor data used in AI-assisted workflows.
  • Establish regulated AI validation foundations including lineage, reproducibility, dataset/model versioning practices, quality gates, train/validation/test governance, and blinded held-out evaluation sets.
 
 
 
 
 
Principal Data & AI Scientist / Tech Lead - Senior Data & AI Scientist
Feb 2022 – Sep 2024 Bothell, WA, USA
  • Designed and scaled real-world-evidence analytics across more than 1,000 hospitals and 1B+ patient visits for post-market surveillance, regulatory evidence, clinical strategy, and commercial decision support.
  • Standardized a reusable library of 150+ computable phenotypes and clinical features used across regulated submissions, analytics, and downstream AI workflows.
  • Reduced dashboard build time from roughly two days to roughly ten minutes through API automation and reusable evidence-generation patterns.
  • Delivered production AI/GenAI systems on AWS, including identity-resolution and graph/vector retrieval workflows with evaluation harnesses for precision, recall, F1, and regression testing.
  • Helped author data and AI strategy narratives, governance practices, and roadmap proposals adopted by senior leadership.
 
 
 
 
 
National Library of Medicine Biomedical Informatics & Data Science Pre-Doctoral Fellow
Sep 2017 – Sep 2021 Seattle, WA, USA
  • Built OMOP-based data warehouse and pipeline assets; assessed whether real and synthetic EHR/audit-log data were fit for secondary use under privacy-preserving constraints.
  • Contributed to National COVID Cohort Collaborative synthetic-data validation work and analyzed more than 1.8M SARS-CoV-2 tests for geospatial and temporal epidemiologic utility.
  • Developed multimodal ML models combining speech/audio, language, and clinical history to predict dementia status and evaluate clinical text/voice biomarkers.
  • Trained GPU-accelerated ML models on Slurm-managed HPC infrastructure and AWS.
  • Released reusable public healthcare data artifacts, including medical NLP code/data for replication and a 2,882-question annotated medical question-type dataset.
 
 
 
 
 
Senior Research Assistant
Apr 2015 – Sep 2017 Portland, OR, USA
  • Developed predictive models and annotation workflows across 100k+ ECGs; sourced clinical data under HIPAA and mined EHR outcomes for publication-ready analyses.
  • Collected and analyzed multimodal electrophysiology and imaging-linked data including ECGs, 128-electrode body-surface ECG/ECGi recordings, device interrogations, and intracardiac EGMs during cath-lab procedures.
  • Co-authored work combining CT/MRI/3D geometry and electrode localization for cardiac activation mapping.
  • Recruited 350+ participants across observational and interventional studies, wrote study protocols, and supported multiple IRB approvals.
 
 
 
 
 
Executive Director
Glow XC 501(c)(3)
Oct 2013 – Jul 2016 Eugene, OR, USA
  • Cofounder 2013, Executive Director 2014+. 300-person race raising $ for rural health EMS
  • Total responsibility for P&L, logistics, legal compliance, 5-10 person team
  • Live radio interviews
 
 
 
 
 
Clinic Associate & Electrocardiogram Technician
Jun 2013 – Mar 2015 Portland, OR, USA
  • Worked at >15 different clinics performing electrocardiograms, blood draws, rapid tests, in-person scheduling & billing, training of >10 new employees and process improvements
 
 
 
 
 
Volunteer Research Assistant
Feb 2012 – Jun 2013 Eugene, OR, USA
  • Conducted & recorded results of V02 max exercise tests and altitude chamber studies with human subjects, processed lab specimens, subject recruiting and scheduling, data analysis
 
 
 
 
 
Facility Manager
Sep 2010 – Jun 2013 Eugene, OR, USA
  • Managed ~10 direct reports per shift in a 250k ft2 facility, first responder & responsible for safety of all students & staff, developed new hiring process to screen 700 applicants
 
 
 
 
 
Clinic Support Staff
Oct 2011 – Jan 2012 Eugene, OR, USA
  • Aided healthcare delivery to a mainly low-income and special needs population
  • Assembled outgoing prescriptions for individual patients and kept track of inventory
  • Assisted in clinic-wide conversion of paper medical records to digital files

Accomplish­ments

Innovator of the Year Finalist (highest individual R&D award)
Finalist after just 1.5 years of working at Philips
Program of the Year Finalist (team award)
Best Business Impact - Natural Language Processing category (short paper)
Short (2 page) paper I wrote as first author selected. The work is confidential, yet in general the paper described our novel use of Large Language Models (e.g. Gen AI) to solve business problems
Editor’s Choice - Research and applications
Manuscript I wrote as first author selected. Was ‘featured’ on JAMIA and made open access by the journal.
Appointed to the AMIA Annual Symposium Scientific Program Committee
Appointed to 2021-2023 JAMIA Editorial Board
Appointed to 2019-2021 JAMIA Student Editorial Board
Awarded Biomedical Informatics & Data Science Pre-Doctoral Fellowship T15 Grant
Full tuition waiver and stipend, ~40 new slots/year nationally
Top Scholar Top off Award
One time extra $ to top 2 recruits/year in the BIME dept
Young Investigator Finalist
Presented initial findings on my original research: ‘Global Electrical Heterogeneity in Young Athletes.’ In this work I had designed a study, traveled to the US Alpine National Championships alone to collect data, and published a paper as equal first author from it as a result.

Recent Publications

Quickly discover relevant content by filtering publications.
Adaptive Cardiac Resynchronization Therapy Effect on Electrical Dyssynchrony (aCRT-ELSYNC): A randomized controlled trial
Background: Adaptive cardiac resynchronization therapy (aCRT) is known to have clinical benefits over conventional CRT, but the …
The National COVID Cohort Collaborative (N3C): Rationale, Design, Infrastructure, and Deployment
Objective: COVID-19 poses societal challenges that require expeditious data and knowledge sharing. Though organizational clinical data …