Vision Model Acceleration
Open-source bounty work bringing up DPT-Large depth estimation on Wormhole N300 accelerators using TTNN, with validation and performance profiling.
Omar Habra • B.S. Computer Engineering • Compilers & AI Systems • Ultramarathon & BJJ
I work across compiler/toolchain engineering and production AI infrastructure: interoperability, performance tuning, and reliability guardrails. Outside of work I train for ultramarathons and BJJ, which keeps me consistent—but the work stands on its own.
Longest race
100mi
Mountain ultra
Elevation
+6,000m
Total gain
Fastest 100k
9h 30m
Weekly vol.
80-100km
Swift/C++ interoperability and LLVM performance

Trail running in the Pacific Northwest
Same endurance mindset as debugging production systems
About / Why work with me
Systems engineer with a B.S. in Computer Engineering, spanning compilers/toolchains and production AI systems.
Built and shipped compiler features at Apple (Swift–C++ interoperability, Apple Silicon bring-up) and SiFive (LLVM/Clang optimization for RISC-V RVV).
In consulting, I design and implement secure multi-tenant LLM products on AWS—retrieval pipelines, evaluation/guardrails, and deployment automation.
Recent open-source work includes TT-Metal TTNN model bring-ups for MaskFormer, DPT-Large depth estimation, and YOLOS-small object detection.
I optimize for correctness, debuggability, and operational reality: tight feedback loops, good tests, and the instrumentation you need when things go sideways.
Comfortable in LLVM/Clang internals and equally comfortable getting things into production: infrastructure as code, CI, and careful rollouts.
I write down decisions, communicate tradeoffs early, and do the unglamorous glue work that keeps teams moving.
Currently
Building and shipping work across toolchains, production AI infrastructure, and accelerator bring-ups—while keeping a steady training habit outside of engineering.
From compiler infrastructure to AI acceleration, each project represents production-grade engineering with real-world results.
Open-source bounty work bringing up DPT-Large depth estimation on Wormhole N300 accelerators using TTNN, with validation and performance profiling.
Significant performance improvements on NASA benchmark suite through LLVM vectorization tuning and custom cost models for RISC-V Vector Extension.
Led the implementation of C++ getter/setter support in the Swift compiler, enabling seamless property-style access across language boundaries. Featured in WWDC 2023.
Swift compiler work shipped in the public toolchain. TT-Metal contributions bring up modern vision models on Tenstorrent accelerators.
Building compilers, toolchains, and production systems at teams pushing the boundaries of what software can do.
AI Infrastructure & Systems Engineer
Seattle, WA
Senior Software Engineer
San Mateo, CA
Software Engineer — Clang/Swift Compiler
Cupertino, CA
From low-level compiler internals to production AI infrastructure, with a focus on performance and reliability.
San José State University
B.S., Computer Engineering • San José, CA
Endurance athletics teaches the same lessons as systems engineering: deliberate practice, measurable progress, and staying calm under pressure.
Seattle Marathon 2024
26.2 miles through downtown Seattle

Full marathon
First marathon finish after four months of structured training.

Purple belt competitor
Multiple matches in a single day, adapting under stress.

Olympic distance
Open water swim, bike, and run.
Open to compiler engineering, AI infrastructure, and systems architecture opportunities. Based in Seattle, WA, remote-friendly.
Ready to discuss your next project?
Whether you need compiler expertise, AI infrastructure, or performance engineering, I'd love to hear about what you're building.