Matthew Warkentin, Developer in Portland, OR, United States
Matthew is available for hire
Hire Matthew

Matthew Warkentin

Verified Expert  in Engineering

Machine Learning Developer

Portland, OR, United States
Toptal Member Since
March 8, 2019

Since 2014, Matthew has been working professionally in the fields he loves, software and data—culminating in him co-founding the Rubota corporation in 2017. Before that, he spent the past decade at Cornell University conducting scientific research specifically in statistical and biological physics. All in all, Matthew is an engaging, intense communicator with a passion for knowledge and understanding.


Toptal Client
Data Science, Databases, Analytics, Data Analysis, Business Requirements...
Toptal Client
Amazon Web Services (AWS), Matplotlib, Keras, TensorFlow, Scikit-learn, Pandas...
Rubota Corporation
Machine Learning, Python, Data Analysis, Business Requirements, Data Pipelines




Preferred Environment

Amazon Web Services (AWS), Linux, Python

The most amazing...

...thing I've done was to co-found Rubota, a supply chain intelligence technology startup.

Work Experience

VP | Data and Analytics

2020 - PRESENT
Toptal Client
  • Built world-class data and analytics functions to handle hundreds of millions of user interactions, covering infrastructure, ETL/warehousing, reporting/dashboards, real-time graph-based recommendations, propensity models, in-app search, and automated QA.
  • Managed over 10x user growth accompanying the launch of strategic partnership.
  • Supported decision-making for 4 - 6 projects or product features per quarter in engineering, product, marketing, and finance.
Technologies: Data Science, Databases, Analytics, Data Analysis, Business Requirements, Data Pipelines, SQL

Data Scientist (Generalist)

2019 - 2020
Toptal Client
  • Developed end-to-end marketing data and analytics solution, covering scraping, fusion, predictive modeling, deployment, reporting, and evaluation.
  • Completed studies and proofs of concept for company leadership and regularly advised at that level.
  • Built a hybrid statistical/NLP model for the impact of online reviews.
  • Created a qualitative analysis framework to develop coding schemes for survey responses.
  • Developed a predictive model for an all-in cost of delivery using years of historical data.
  • Built prototype eCommerce pricing and UX model based on years of historical sales data. Spun out with multiple rounds raised.
Technologies: Amazon Web Services (AWS), Matplotlib, Keras, TensorFlow, Scikit-learn, Pandas, SQLAlchemy, SQL, ECS, Python, Pricing Models, PostgreSQL, Data Analysis, Business Requirements, Data Pipelines

Co-founder | Vice President of Data and Analytics

2017 - 2019
Rubota Corporation
  • Collected and integrated data from disparate sources into a unified model.
  • Worked with the chief engineer to develop a platform data model.
  • Integrated in-house and third-party entity analytics.
Technologies: Machine Learning, Python, Data Analysis, Business Requirements, Data Pipelines

Data Scientist

2014 - 2016
Thetus Corporation
  • Produced prototypes and handled third-party integrations.
  • Engaged with customers to understand their data and applications.
  • Supported sales and marketing with demonstrations tailored to target customers.
Technologies: Amazon Web Services (AWS), Python, Data Analysis, Business Requirements, Data Pipelines

Postdoctoral Researcher

2009 - 2014
Cornell University
  • Authored eight peer-reviewed studies in X-ray science, structural biology, and statistical mechanics.
  • Developed novel analytical and visualization tools to investigate protein conformational motions.
  • Managed teams running experiments at Cornell’s X-ray source and Argonne National Lab under extreme time pressure (typically 24 to 48 hours from start to finish).
  • Built and maintained data pipelines to construct 3D models of macromolecules from 1000s of X-ray images.
Technologies: Linux, Python, Data Analysis, Data Pipelines

Graduate Research Assistant

2004 - 2009
Cornell University
  • Pioneered experimental techniques to exploit opportunities in the rapidly-evolving field of structural biology.
  • Standardized and automated existing data collection and processing practices resulting in a greatly increased impact of the final product.
Technologies: Linux, Python, Data Analysis, Data Pipelines

Rubota Corporation

At Rubota, a supply chain intelligence technology startup, I served as the co-founder and VP of data and analytics. My primary focus was on developing unstructured analytics for integrated supply chain for an enterprise customer.


SQL, Python, JavaScript


React, Pandas, SQLAlchemy, Scikit-learn, TensorFlow, Keras, Matplotlib


Statistical Analysis, Machine Learning, Statistics, Physics, Modeling, Full-stack, Software Development, Data Analysis, Data Visualization, Data Engineering, Visualization, Business Requirements, ECS, Optimization, Computer Vision, Pricing Models, Analytics


Data Science


Amazon Web Services (AWS), Linux


PostgreSQL, Data Pipelines, Databases

Industry Expertise


2004 - 2009

PhD Degree in Physics

Cornell University - Ithaca, NY, USA

2000 - 2004

Bachelor of Arts Degree in Physics

UCSC | University of California, Santa Cruz - Santa Cruz, CA, USA

Collaboration That Works

How to Work with Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.


Share your needs

Discuss your requirements and refine your scope in a call with a Toptal domain expert.

Choose your talent

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring