Golden Set Analytics Portfolio

Golden Set Analytics

Big Data, Statistical and Algorithmic Software for ATP & WTA Tennis Matches

End-to-End Analytics Architecture

  • Established a data product architecture with a data warehouse for structured data and a data lake for semi-structured and unstructured data.
  • Integrated relational data sources with other unstructured datasets, with the use of big data processing technologies.
  • Used semantic modeling and visualization tools for simpler data analysis.
  • Shared datasets within organization and with trusted external partners.

Usage Analysis

  • Report on how GSA’s video client app is being used.
  • Report on how many times PDFs are viewed.
  • GDPR & CCPA compliant
  • Identity tracking of authenticated users for distinguishing usage by user.
  • Selected Vendors: Google Analytics + Microsoft Clarity

Message Broker + Data Factory + Automation

  • Based on documented Events and Use Cases, updated architecture diagram at Live Stats Solution for implementing Azure Event Grid based on ingesting HawkEye data using publisher/subscriber model
  • Review current implementation of ETL that moves data from Azure Databricks to Azure SQL and describe options for reducing downtime during pipeline execution.
  • Designed an event-driven architecture so processes can be executed automatically and immediately (ex. when a new match is added)

Feature Flags + Database Versioning

  • Update video player to make features only accessible to certain logins. This can be done by branching the code and deploying to a different environment / subdomain. As an alternative, you can configure specific features on or off in the main branch code and run in a single environment.
  • Get database code (table schemas, sprocs, etc…) into version control so you can deploy different versions of the database along with feature flags above.

Video Filters

Review current server implementation and transition to client-side/browser implementation of FFMPEG + WASM for reducing cloud spend.

Technologies

Draw.io, MermaidJS, Azure, FFMPEG + WASM, .NET / Blazor + C#, PostgreSQL, Dropbox, Python / Jupyter Notebooks, Databricks, Streamlit, Okta, Slack, Bitbucket, Confluence