Chris Crawford
Data Engineer
Experience
Data Engineer
John Deere (Agriculture/Manufacturing) | Apr 2021 – Present
- Led migration of ML model serving infrastructure from AWS SageMaker to Databricks, designing architectural diagrams, migration flow charts, and technical documentation for cross-platform deployment
- Built and maintained MLOps infrastructure using Terraform and GitHub Actions, including IAM roles/policies, Databricks model serving endpoints, and Unity Catalog permissions for production ML workloads
- Implemented real-time alerting and monitoring systems for ML inference pipelines, including Databricks alerts with Microsoft Teams notifications and automated non-200 status monitoring
- Developed reusable GitHub Actions for secure AWS Secrets Manager integration, enabling centralized credential management across data engineering repositories with OIDC authentication
- Established CI/CD pipelines and infrastructure-as-code practices for Databricks deployments, including schema management, inference table grants, and service principal permissions
- Contributed technical documentation and architectural designs for the precision agriculture platform, supporting team knowledge sharing and onboarding for ML infrastructure initiatives
Awards:
- 2025 – Quarterly Award: Developed machine learning infrastructure to classify machine state at 250k+ requests per min.
- 2023 – Quarterly Award: Developed machine learning API that enabled legacy machines to benefit from data science initiatives.
- 2022 – Quarterly Award: Developed data pipeline to facilitate safely training ML models with data protected by global privacy laws.
Data Engineer
Team Rubicon (Disaster Response Non-profit) | Mar 2019 – Feb 2021
- Developed analytical pipeline to quantify financial impact of volunteers over 300+ disaster operations; secured $200k donation.
- Leveraged GIS to map and prioritize disaster relief needs by combining CDC Social Vulnerability Index with operational data.
- Designed and developed the entire data analytics platform for the organization with Azure Datalake, Databricks, and Function apps.
- Migrated data analytics platform from Palantir Foundry to Databricks; enabled organization-wide access to more data analytics tools.
- Mentored two junior analysts on Power BI and SQL; doubled data engineering capacity and reduced requests for ad-hoc analyses.
Awards:
- 2021 – Esri Special Achievement in GIS
- 2019 – U.S. President’s Volunteer Service Award with special commitment to Disaster Response Services.
Data Analyst, Developer Advocate (contract)
Kaggle (Data Science Competitions) | Jun 2017 – Mar 2019
- Partnered with the NFL to develop NFL First and Future, a crowd-sourced competition to improve player safety through data science.
- Created a new competition format with non-profit partners, driving an 8% increase in competition launches in 2018.
Software Engineering Intern
Smashrun (Fitness Analytics) | Sept 2016 – Dec 2016
- Developed a data pipeline for processing geojson data into map tiles and generating geospatial visualizations of user running activity.
Bioinformatics Engineering Intern
Adaptive Biotechnologies (Biotech) | May 2016 – Sept 2016
- Developed bioinformatics tool with Python and R to automate chi-square testing of DNA sequencing data for sample independence.
- Automated statistical reporting and generating scientific publication-ready data visualizations using Pandas, Matplotlib, Seaborn.
Aircraft Electrical and Environmental Specialist
United States Air Force | Dec 2005 – Dec 2009
- Trained airmen on aircraft maintenance and safety, achieving 100% success in quality assurance and zero injuries over two years.
- Maintained electrical and avionics systems on C-130 and MQ-9 aircraft, ensuring mission readiness for fleets valued at over $500M.
- Participated in Exercise Balikatan, the annual U.S.-Philippines military exercise focused on counterterrorism operations, humanitarian assistance, and disaster relief, maintaining aircraft readiness for combined training operations.
- Supported Operation Christmas Drop, the world’s longest-running humanitarian airdrop mission, maintaining aircraft systems for Pacific Air Forces training operations delivering essential supplies to remote Pacific island communities.
- Collaborated with Japanese Air Self-Defense Forces on joint maintenance operations, facilitating knowledge exchange and strengthening international partnerships.
Education
Master of Science, Bioinformatics – Northeastern University, Dec 2016
Bachelor of Science, Biology – Pacific Lutheran University, Dec 2013
Associate of Applied Science, Aircraft Maintenance Technology – Community College of the Air Force, May 2008
Interests
- Autonomous Rover: I am building an autonomous rover to deliver homebrew to my neighbors (blog post)
- Homebrewing beer: I have been homebrewing beer for over 15 years and I even brewed beer professionally for a couple of years.
- 3D printing: I use OnShape to create 3D models for the various projects I am working on. I have designed, modeled, and printed tools for homebrewing, biodegradable seed starters, and parts for my autonomous rover (Printables)
- Self-hosting and Homelab: I maintain a 3-node cluster of servers in my basement where I use Proxmox for virtualization and high availability. I host my own servers for Gitea (GitHub alternative), games, VPNs, and various other services.
- Kaggle: I am a Kaggle Grandmaster and was the world’s first Kaggle Datasets Grandmaster (profile)
Skills
Cloud & Infrastructure: AWS, Terraform, Docker, Lambda, S3, API Gateway, ECS, ECR, AWS Secrets Manager, Active Directory
Data & Databases: SQL, PostgreSQL, MySQL, Redis, DynamoDB, Data Lake, Delta Lake, Spark, Scala
ML & Analytics: Databricks, SageMaker, MLFlow, Model Serving, Machine Learning, Python, Pandas
GIS & Geospatial: Esri, ArcGIS, GIS, Shapely
DevOps & Testing: GitHub Actions, Jenkins, Drone, CI/CD, Unit Testing, Integration Testing, Load Testing, Locust
APIs & Monitoring: REST API, OpenAPI, Swagger, Datadog, Dashboards