Chris Crawford

Data Engineer

crawforc3@pm.me


Experience

Data Engineer

John Deere (Agriculture/Manufacturing) | Apr 2021 – Present

  • Led migration of ML model serving infrastructure from AWS SageMaker to Databricks, designing architectural diagrams, migration flow charts, and technical documentation for cross-platform deployment
  • Built and maintained MLOps infrastructure using Terraform and GitHub Actions, including IAM roles/policies, Databricks model serving endpoints, and Unity Catalog permissions for production ML workloads
  • Implemented real-time alerting and monitoring systems for ML inference pipelines, including Databricks alerts with Microsoft Teams notifications and automated non-200 status monitoring
  • Developed reusable GitHub Actions for secure AWS Secrets Manager integration, enabling centralized credential management across data engineering repositories with OIDC authentication
  • Established CI/CD pipelines and infrastructure-as-code practices for Databricks deployments, including schema management, inference table grants, and service principal permissions
  • Contributed technical documentation and architectural designs for the precision agriculture platform, supporting team knowledge sharing and onboarding for ML infrastructure initiatives

Awards:

  • 2025 – Quarterly Award: Developed machine learning infrastructure to classify machine state at 250k+ requests per min.
  • 2023 – Quarterly Award: Developed machine learning API that enabled legacy machines to benefit from data science initiatives.
  • 2022 – Quarterly Award: Developed data pipeline to facilitate safely training ML models with data protected by global privacy laws.

Data Engineer

Team Rubicon (Disaster Response Non-profit) | Mar 2019 – Feb 2021

  • Developed analytical pipeline to quantify financial impact of volunteers over 300+ disaster operations; secured $200k donation.
  • Leveraged GIS to map and prioritize disaster relief needs by combining CDC Social Vulnerability Index with operational data.
  • Designed and developed the entire data analytics platform for the organization with Azure Datalake, Databricks, and Function apps.
  • Migrated data analytics platform from Palantir Foundry to Databricks; enabled organization-wide access to more data analytics tools.
  • Mentored two junior analysts on Power BI and SQL; doubled data engineering capacity and reduced requests for ad-hoc analyses.

Awards:

Data Analyst, Developer Advocate (contract)

Kaggle (Data Science Competitions) | Jun 2017 – Mar 2019

  • Partnered with the NFL to develop NFL First and Future, a crowd-sourced competition to improve player safety through data science.
  • Created a new competition format with non-profit partners, driving an 8% increase in competition launches in 2018.

Software Engineering Intern

Smashrun (Fitness Analytics) | Sept 2016 – Dec 2016

  • Developed a data pipeline for processing geojson data into map tiles and generating geospatial visualizations of user running activity.

Bioinformatics Engineering Intern

Adaptive Biotechnologies (Biotech) | May 2016 – Sept 2016

  • Developed bioinformatics tool with Python and R to automate chi-square testing of DNA sequencing data for sample independence.
  • Automated statistical reporting and generating scientific publication-ready data visualizations using Pandas, Matplotlib, Seaborn.

Aircraft Electrical and Environmental Specialist

United States Air Force | Dec 2005 – Dec 2009

  • Trained airmen on aircraft maintenance and safety, achieving 100% success in quality assurance and zero injuries over two years.
  • Maintained electrical and avionics systems on C-130 and MQ-9 aircraft, ensuring mission readiness for fleets valued at over $500M.
  • Participated in Exercise Balikatan, the annual U.S.-Philippines military exercise focused on counterterrorism operations, humanitarian assistance, and disaster relief, maintaining aircraft readiness for combined training operations.
  • Supported Operation Christmas Drop, the world’s longest-running humanitarian airdrop mission, maintaining aircraft systems for Pacific Air Forces training operations delivering essential supplies to remote Pacific island communities.
  • Collaborated with Japanese Air Self-Defense Forces on joint maintenance operations, facilitating knowledge exchange and strengthening international partnerships.

Education

Master of Science, Bioinformatics – Northeastern University, Dec 2016

Bachelor of Science, Biology – Pacific Lutheran University, Dec 2013

Associate of Applied Science, Aircraft Maintenance Technology – Community College of the Air Force, May 2008


Interests

  • Autonomous Rover: I am building an autonomous rover to deliver homebrew to my neighbors (blog post)
  • Homebrewing beer: I have been homebrewing beer for over 15 years and I even brewed beer professionally for a couple of years.
  • 3D printing: I use OnShape to create 3D models for the various projects I am working on. I have designed, modeled, and printed tools for homebrewing, biodegradable seed starters, and parts for my autonomous rover (Printables)
  • Self-hosting and Homelab: I maintain a 3-node cluster of servers in my basement where I use Proxmox for virtualization and high availability. I host my own servers for Gitea (GitHub alternative), games, VPNs, and various other services.
  • Kaggle: I am a Kaggle Grandmaster and was the world’s first Kaggle Datasets Grandmaster (profile)

Skills

Cloud & Infrastructure: AWS, Terraform, Docker, Lambda, S3, API Gateway, ECS, ECR, AWS Secrets Manager, Active Directory

Data & Databases: SQL, PostgreSQL, MySQL, Redis, DynamoDB, Data Lake, Delta Lake, Spark, Scala

ML & Analytics: Databricks, SageMaker, MLFlow, Model Serving, Machine Learning, Python, Pandas

GIS & Geospatial: Esri, ArcGIS, GIS, Shapely

DevOps & Testing: GitHub Actions, Jenkins, Drone, CI/CD, Unit Testing, Integration Testing, Load Testing, Locust

APIs & Monitoring: REST API, OpenAPI, Swagger, Datadog, Dashboards