William Ronchetti

Back-end & infrastructure engineer. I build secure, scalable AWS data platforms that move petabytes.

Alexandria, VA M.Eng. Computer Science, Cornell

01

about

I'm a back-end and infrastructure software engineer with 7+ years building secure, scalable AWS data platforms in Python. My work lives where research data meets production infrastructure — turning organically grown systems into reproducible, audited, compliant platforms.

My strengths are infrastructure-as-code, API and database performance optimization, container-based deployments, and observability — delivering FISMA-Moderate and GDPR-compliant systems that have passed multiple independent security audits.

  • 7+years building
    data platforms
  • PBscale data
    under management
  • 5peer-reviewed
    publications
  • 267citations
    across that work
02

experience

  1. Data Product Engineer

    Sept 2024 — Present

    Novo Nordisk (via Kelly Services Global) · Remote

    • Spearhead development of the Omics Platform, an internal AWS system managing petabytes of diverse omics data for dozens of research teams — both cataloguing and analysis.
    • Built a highly scalable ingestion pipeline on Lambda, Glue, and Batch for validation and transfer of raw data and discovery metadata across several dozen vendors/CROs.
    • Designed a metadata model for tracking diverse omics datasets using Parquet and Amazon Athena.
    • Review code and mentor a small team on software, cloud-engineering, and security best practices to ensure GDPR compliance.
    • AWS Lambda
    • Glue
    • Batch
    • Parquet
    • Athena
    • Python
  2. Senior Software Engineer

    Aug 2019 — Present

    Harvard Medical School — Dept. of Biomedical Informatics · Remote / Boston, MA

    Senior since 2022 (half-time since Jan 2025) — previously Software Engineer

    • Lead back-end development in Python for the CGAP, 4DN, and SMaHT data platforms, supporting petabytes of raw data and associated metadata.
    • Refactored complex, organically grown AWS infrastructure into reproducible infrastructure-as-code (CloudFormation), achieving FISMA-Moderate compliance and passing multiple independent security audits.
    • Migrated the web platform from Apache on Elastic Beanstalk to Nginx on ECS Fargate, cutting compute costs by 50% with configurable autoscaling.
    • Implemented observability with structured event logging via a Splunk HEC pipeline and container log shipping through FireLens / Fluent Bit.
    • Optimized Postgres and Elasticsearch query performance by resolving aggregation timeouts from large terms filters — 50% faster responses.
    • CloudFormation
    • ECS Fargate
    • Nginx
    • PostgreSQL
    • Elasticsearch
    • Splunk
    • FISMA
  3. Associate in Research (part-time)

    Feb 2019 — Present

    Duke University — Dept. of Electrical & Computer Engineering · Durham, NC

    • Develop assignments for an advanced C++ course and improve introductory C programming assignments on Coursera; support students with C and Python coursework.
  4. Teaching Assistant (part-time)

    Aug 2016 — Dec 2018

    Cornell University · Ithaca, NY

    • Taught Operating Systems (CS 4410/4411) and System Security (CS 5430/5431) with practicum sections; built and improved C autograders and graded exams.
03

selected work

Co-author on 5 peer-reviewed publications for data-platform and pipeline contributions, with 267 citations. Selected highlights:

stack

// languages

Python · C/C++ · Java · OCaml · SQL

// infrastructure & IaC

CloudFormation · VPC · EC2 · ECS (Fargate) · ECR · Elastic Beanstalk · Lambda · Batch · Step Functions · API Gateway · Route 53 · Docker · Nginx · Apache

// data & storage

PostgreSQL (RDS) · Elasticsearch · DynamoDB · Amazon Athena · S3 · Parquet

// observability & CI/CD

CloudWatch · Splunk (HEC) · Fluent Bit / FireLens · Sentry · GitHub Actions · Travis CI · ReadTheDocs

// security

AWS KMS · Secrets Manager · Security Hub · ACM · FISMA-Moderate · GDPR

// scientific pipelines

Nextflow · Seqera

04

community & athletics

Coaching

Assistant Cross-Country and Track & Field coach at Arlington Public Schools (Yorktown) — helping the next generation of distance runners train, race, and stick with the sport.

Volunteering

Volunteer with Virginia German Shepherd Rescue and the Lost Dog & Cat Rescue Foundation, supporting fostering and adoption efforts.

Running

Varsity Cross Country and Indoor & Outdoor Track at Cornell. A lifelong endurance athlete — the same patience and consistency I bring to long-running systems work.