scientist specializing in data science and machine learning
Providing well-documented code, clear communication, iterative project roadmaps and understandable results
Providing well-documented code, clear communication, iterative project roadmaps and understandable results
I have over a decade of experience with data modeling and analysis in the fields of data science, machine learning and deep learning.
As an example of my work, please review this paper I co-authored in partnership with Adaptive Biotech and Miscroft; using machine learning to create a better diagnostic for covid-19:
In working with large datasets, I've become an expert at creating centralized data repositories, standardizing, integrating and enabling access to data with pipelines, ETL and datalake solutions.
Rather than hiring a team of data engineers, I would be more than happy to help you create a well-organized data pipeline that provides your data team with quicker access to standardized and vaidated datasets.
One of my data pipeline solutions using pyspark and hadoop was utlitized to examine pre-term birth in collaboration with the Inova healthcare system:
Teaching and coaching are one of my most satisfying endevours. In my spare time I help write tech manuals for projects that I've worked on, as well as across the company. I have been handed many projects throughout my career, so I know very well how important it is to ensure that your code is well-documented, optimized and understandable.
In addtion to tech writing, I've also been teaching part time at the Univeristy of Washington for the last six years in both the CS department and the data science department.
I won an award from UW in teaching excellence- twice.
My hourly rates are best for smaller projects- if you have a larger project we can discuss a fixed rate instead.
I will scrub and analyze your data according to your needs, as well as supply you with a jupyter notebook including results and standard visualizaitons.
If you would like to create a re-usable pipeline, I can create this for your using python, R, SQL and/or pyspark. The final product will include a script that parses your data, runs your anaysis and outputs standardized visualizations.
I can productionize models that you would like to run on the cloud, as well as methods to monitor and update. You will need access to cloud services that allow for model serving, such as AWS or databricks. I am happy to help advise on architecture to fit your needs.
After fully understanding your companies infrastructure and goals, I will work with you to create the right architecture to fit your needs. The output will be a presentation on my findings and recommendations that fully explains your options and how various methods may work for your company.
I will create, or work with a team to create, a centralized repository of your datasets. The final product will be an ETL pipeline to injest, vailidate, scrub and standardize data into a data lake or warehouse, as well as a method that will allow your team to access the data via script or user interface. Datalake/warehouse costs are not included.
If you do not already have policies to standardize your dataset, validate, annotate and validate, I can advise on implementation of data governance policies. Final product delivered will be a data catalog along with a data policy/security audit and recommendations.
I will provide you with a course syllabus and lesson plans including:
- powerpoint presentation for each class
- one hands-on workshop per class
- homework assignments and answer key
- course setup on canvas or other learning platform
I love teaching courses in stats, data science, machine learning, python and R. I am located in Pittsburgh for live courses, and have years of experience with distance learning.
Please discuss your need for user manuals, code documentation, training courses and content creation.
References available upon request.
Pittsburgh, Pennsylvania, United States
Summer Rae (412) 572-1547 summer@summerela.com
Open today | 09:00 am – 05:00 pm |
I work from the east coast, but am happy to accomodate your schedule.