AI Infrastructure & Data Center Networking

I support hyperscale AI infrastructure across Oracle Cloud, maintaining thousands of GPU clusters and high‑density networks. I specialize in fiber and optical diagnostics, network health automation and rack‑level innovations.

Get in Touch

Current Work

  • Support thousands of AI compute racks including NVIDIA H100, GB200, GB300 and AMD MI355X GPU systems across multiple OCI facilities.
  • Maintain and troubleshoot hyperscale networking and fiber infrastructure, acting as the go‑to resource for optical diagnostics and link integrity.
  • Develop internal automation scripts to evaluate switch health, optical power, interface errors and fiber integrity, accelerating break‑fix diagnostics.
  • Performed the first GB200 rack decommission globally during Chaos Lab validation of early production hardware.
  • Assist with the deployment and turn‑up of new AI infrastructure facilities and expansions, including dedicated MI355X and GB200/GB300 buildings.

Professional Experience

Data Center Technician II - Oracle

2025 – Present

  • Support hyperscale AI infrastructure including NVIDIA H100, GB200, GB300 and AMD MI355X GPU clusters across multiple OCI facilities.
  • Maintain and troubleshoot thousands of AI compute and network racks, ensuring uptime across high‑density spine‑leaf networks.
  • Diagnose and repair Arista, NVIDIA, Juniper, Cisco and Opengear networking hardware, including switches, optics and power supplies.
  • Develop automation scripts to evaluate switch health, optical power, interface errors and fiber link integrity for accelerated break‑fix workflows.
  • Performed the first GB200 rack decommission globally during Chaos Lab validation of early production hardware.
  • Assist with the turn‑up of new buildings dedicated to MI355X and GB200/GB300 clusters.

Infrastructure Repair Technician - EOS / Meta

Nov 2024 – Oct 2025

  • Diagnosed and repaired Cisco, Juniper, Arista and other switches, including optics, fans and power supplies.
  • Used fboss2 and proprietary tools to analyse optical loss, trace fiber circuits and verify transceiver health.
  • Coordinated RMA requests and updated asset records across internal ticketing systems.
  • Troubleshot BGP, OSPF and STP pathing errors and resolved fiber loss incidents in live environments.
  • Mentored new technicians, guiding diagnostic methods, tool usage and process workflows.
  • Supported thousands of network devices to meet 99.99% uptime targets at large AI‑enabled data centers.

Data Center Technician - TEKsystems / ZT Systems

Jun 2023 – Oct 2024

  • Installed and maintained NVIDIA DGX and H100 systems, optimizing airflow and power balancing.
  • Led structured cabling layouts and network setups for high‑density GPU racks, resolving hardware and network failures.
  • Supported NVIDIA Spectrum switches and InfiniBand fabrics for H100 clusters, managing MAC/IP configurations and monitoring performance.

Technical Focus

AI Infrastructure

NVIDIA H100, GB200, GB300 and AMD MI355X GPU clusters; high‑density AI rack deployments.

Networking

Arista, Juniper, Cisco, Opengear & NVIDIA switches; spine‑leaf architectures; QSFP28/QSFP‑DD optics; InfiniBand fabrics.

Automation & Scripting

Bash & Python for network health diagnostics, switch health analysis and automation; optical power evaluation and break‑fix workflows.

Tools & Systems

Linux CLI, fboss2, fcr, Cisco IOS, Juniper OS, NVIDIA firmware; asset lifecycle & RMA management.

Protocols & Diagnostics

BGP, OSPF, VLANs, MAC/IP tracing; optical diagnostics, fiber tracing & network health monitoring.

AI Infrastructure I Work With

I work across the entire hyperscale AI training stack, from GPU compute racks to the networking and optical systems that bind them together.

  • GPU compute racks: NVIDIA H100, GB200, GB300 and AMD MI355X systems
  • Spine and leaf switches: Arista, NVIDIA Spectrum/Quantum, Juniper and Cisco platforms
  • High‑density fiber and optical links supporting spine–leaf network architectures
  • Rack‑level power, cooling and structured cabling for AI training clusters

Projects & Apps

Network diagnostics tool icon

Network Diagnostics Tool

A Bash-based CLI utility designed to analyse switch health, optical power levels and fiber link integrity across AI cluster networks, simplifying break‑fix diagnostics for technicians.

  • Evaluates optical RX/TX power, interface errors and link health
  • Automates network health checks across spine and leaf switches
  • Accelerates troubleshooting for technicians of all experience levels
BabySphere app screenshot

BabySphere

BabySphere is an iOS app designed for babies to explore and interact with colours. Built with Swift and Xcode using the MVC architecture, it provides a simple and engaging experience.

  • Fun color exploration for young children
  • Clean architecture for easy maintenance
  • Available on the App Store
Wheel of Meals app screenshot

Wheel of Meals

Wheel of Meals helps users decide where to eat by randomly selecting from a list of user‑created options. Developed in Swift, it offers a fun way to make dining decisions.

  • Randomly selects dining options
  • Simple, intuitive interface
  • Includes a demo video

Art & Hobbies

Get in Touch

Interested in collaborating or have a question? Let's talk.