CH Charles Hayes
Menu

SRE · AI Builder · Incident Response

I turn production noise into signal.
AI drives clarity.

Senior SRE at LPL Financial. I combine production-incident expertise with AI-fluent ways of working — building practical AI tools for observability, incident analysis, and decision support.

Faster Resolution Reduced Risk Operational Clarity
Logs Metrics Traces Incidents AI Insights
Featured Projects
Stock Narrative Explorer screenshot

Stock Narrative Explorer

Interactive equity-narrative tool that generates confidence-tiered, day-by-day AI explanations for stock movements (NVDA, MSFT, TSLA, LPLA), combining yfinance, Finnhub, SEC EDGAR, and Google News with daily drift detection.

Key Capabilities

  • Daily AI narratives for NVDA, MSFT, TSLA, LPLA
  • 4-source ingestion (yfinance, Finnhub, SEC EDGAR, Google News)
  • Confidence-tiered output with structured citations
  • Drift detection re-evaluates stale narratives
Python FastAPI Anthropic API React/Vite Docker Railway
Inside LPL Internal at LPL Financial

Performance Impact Analyzer

Quantifies advisor and investor experience degradation during major incidents.

Observability Metrics Framework

Used in monthly CIO-level platform stability reviews.

NLP Incident Analysis

Recurring root-cause theme detection across major-incident records.

Agentic AI Strategy & Onboarding Playbook

Adopted across the SRE team for responsible AI-assisted ways of working.

What I Do

Reliability Engineering

Designing observability frameworks and post-incident programs that hold up at enterprise scale.

Incident Analysis

Turning messy production signals into clear root-cause narratives — manual rigor with AI acceleration.

AI & Automation

Building practical AI tools (Claude Code, Anthropic API) that replace toil and surface decisions.

Communication & Leadership

Translating multi-domain technical failures into business narratives senior leadership can act on.

Experience
Sep 2025 – Present

Senior Systems Engineer, SRE Insights

LPL Financial

  • · Built observability metrics framework now used in monthly CIO-level platform stability reviews
  • · Built AI-assisted Performance Impact Analyzer (Claude Code, Python, Dynatrace) — replaced 10+ hours/week of manual analysis
  • · Applied NLP analysis across major-incident records to surface recurring root-cause themes
  • · Delivered a six-week Apdex deep-dive across 100+ advisor offices
  • · Authored agentic AI tool strategy and onboarding playbook adopted across the team
Observability Dashboards Python Automation
Sep 2022 – Sep 2025

Senior Systems Engineer, Problem Management

LPL Financial

  • · Directed 320 major problem investigations including 70 Critical/High Priority incidents
  • · Led LPL response coordination during the global CrowdStrike outage (Jul 2024)
  • · Spearheaded "Guardians of the Uptime" — secured EPMO funding based on demonstrated business impact
  • · Modernized Problem Management Confluence knowledge base; standardized Five-Why cause maps
  • · ProductTech Monthly Value Award (Sep 2024) — "Seek, Embrace, and Apply Feedback"
Incident Management RCA Service Improvement Cross-functional Coordination

Earlier: Tech Intern, Infrastructure Change Management at LPL (2020) · IT End User Services Intern at TriNet (2019)

Education & Certifications

B.S. Computer Science

Clemson University Honors College

May 2021

Cum Laude · Upsilon Pi Epsilon · General Honors

Certifications

AWS Cloud Practitioner (2021, renewed 2024) ITIL®4 Foundation (2022) ITIL®4 Practitioner: Problem Management (2024) Lean Six Sigma Yellow Belt (2023) McKinsey Academy Business Leadership (2025) Agile Foundations (2021)
Skills

AI & Automation

Claude Code Cursor GitHub Copilot Anthropic API Prompt Engineering Agentic Tool Design NLP-based Incident Analysis

Reliability & Operations

Site Reliability Engineering Incident Management Major Problem Management Root Cause Analysis Five-Why Cause Mapping ITIL®4

Change & Enablement

Change Management Cross-functional Coordination Knowledge Base Modernization Mentorship Executive Communications

Observability & Tooling

Dynatrace AlertSite ServiceNow Jira/Confluence AWS SharePoint MS365

Analytics & Reporting

Performance Analysis Apdex Metrics Frameworks Data Storytelling Business Impact Translation

Let's build systems that learn,
adapt, and create real impact.

Open to collaborations and opportunities. Charlotte, NC.