Raúl Díaz García
I am a Senior Data Scientist and Computer Vision Engineer at the Professional Services division of Amazon Web Services. Before joining AWS, I worked as a Research Engineer at HP Inc. My work pivots around the design and deployment of new products related to Computer Vision and Machine Learning. My main job focus involves technologies such as 3D scanning, photogrammetry, metrology, and deep learning on 2D and 3D data.
I was a PhD student at UC Irvine, where I was advised by professor Charless Fowlkes at the Computer Vision group. I explored the rapid growth of social networks and digital mapping data, which offers a rich source of contextual information that can be exploited in practical computer vision systems. My Ph.D. research focused on improving object detection and image segmentation by finding geometric context cues. In particular, I investigated how structure from motion and multi-view stereo can help in the world of scene understanding.
GitHub /
Google Scholar /
LinkedIn /
CV
|
|
|
Raúl Díaz
Strong Geometric Context for Scene Understanding
University of California, Irvine, 2016
thesis /
bibtex /
|
|
Soft Labels for Ordinal Regression
Raúl Díaz, Amit Marathe
Conference on Computer Vision and Pattern Recognition (CVPR), 2019
paper /
poster /
errata /
bibtex /
A general label formulation of naturally ordered categories that seamlessly blends metric losses and categorical losses. In just two lines of code , we improve the state of the art on wildly different tasks.
|
|
Cluster-Wise Ratio Tests for Fast Camera Localization
Raúl Díaz, Charless Fowlkes
Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2017
paper /
poster /
code /
bibtex /
We exploit the negligible cost of exploring the adjacent leaves in kd-tree searches to rapidly cast votes on plausible camera locations. We then back-match these scene landmarks against the query image for high quality PnP solving of the 6DOF camera pose.
|
|
Lifting GIS Maps into Strong Geometric Context for Scene Understanding
Raúl Díaz, Minhaeng Lee, Jochen Schubert, Charless Fowlkes
Winter Conference on Applications of Computer Vision (WACV), 2016
paper /
poster /
bibtex /
We present a Sketchup editing toolbox to turn 2D GIS maps into fully semantic 3D scenes by overlaying geo-registered images. Depth, detection, and segmentation are then improved by injecting geo-semantic features into known specialized algorithms.
|
|
Detecting Dynamic Objects with Multi-View Background Subtraction
Raúl Díaz*, Sam Hallman*, Charless Fowlkes
International Conference on Computer Vision (ICCV), 2013
paper /
supplementary /
poster /
bibtex /
We use Structure from Motion and Multi-View Stereo to unsupervisedly mine hard negatives, and then re-score object detections based on background masks, achieving up to a 50% boost over baselines.
* equal contribution
|
|
Multi-View Background Subtraction for Object Detection
Raúl Díaz*, Sam Hallman*, Charless Fowlkes
Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2013
abstract /
bibtex /
A first introduction into how algorithms that target the modelling of static scenes can help object detectors find non stationary categories.
* equal contribution
|
|
PointNet for Keras
code /
Keras implementation of the PointNet 3D classification network.
- Includes orthogonal regularization of features
- Optimized for fast training (1D convolutions and fast 2D max-pooling)
- Achieves similar rates of original paper (~88%-89% accuracy)
|
|