Francisco Romero

PhD

Heterogeneous Serverless Computing for Interactive Applications

Advisor: Christos Kozyrakis

Active

VIVA is an end-to-end system for interactive video analytics. [Publication (vision)] [Publication (full)] [Preprint]

Completed

Llama is a heterogeneous and serverless framework for auto-tuning video pipelines. [Publication]

Faa$T (also here) is a transparent auto-scaling distributed cache for serverless applications. Faa$T's design is motivated by a comprehensive study of data access patterns in Azure Functions. This project was in collaboration with Microsoft Research. A version of Faa$T is available in production Azure Functions. [Publication]

INFaaS is an inference-as-a-service system that abstracts resource management and model selection. [Publication (full)] [Publication (system vision)] [Publication (scheduler vision)]

gg is a highly-scalable, function-as-a-service orchestration management framework for applications that are normally run on a laptop or on a cluster that can stay idle for long periods of time (e.g., video encoding and compilation). [Publication]


Undergraduate and Masters

Methods for Selecting the Right Storage Technology for Larger-Than-Memory Databases

Advisor: David Cheriton

In-memory databases have strict latency and quality-of-service requirements, and when the database does not fit in memory, performance suffers. We have studied the impacts of paging and context switches on an in-memory databases by using a custom file system kernel module and a multi-threaded b-tree benchmark. Our studies have shown that in order to have a minimal performance degradation when only parts of the b-tree fit in memory, page faults would have to be "close to free". Using low-latency secondary storage technologies, we have developed a useful method that allows service providers to characterize their dataset based on throughput requirements and determine what storage backend technology is most cost-effective to suit their needs. We demonstrate this using Intel's 3DXP and NAND flash, and show that, on average, service providers only need to provision small DRAM amounts across multiple servers to meet their desired Quality-of-Service constraints. [arXiv]


Mage: Online and Interference-Aware Scheduling for Multi-Scale Heterogeneous Systems

Advisor: Christina Delimitrou

Scheduling applications on heterogeneous machines remains challenging, in particular due to the machines sharing various hardware resources. This becomes especially important as servers progressively get replaced and upgraded during a datacenter's lifetime. We developed Mage, a runtime system that leverages online machine learning techniques to determine the performance of co-scheduled applications in a heterogeneous system under any application-to-core mapping. Mage monitors the application performance and quickly reacts to discrepancies between predicted and actual performance within a few milliseconds. [Publication]


Behavioral Signal Processing

Advisor: Shrikanth Narayanan

In psychotherapy, the language used by therapists is critical in influencing the overall session. Thus, being able to predict and measure the level of empathy from a particular session can assist in judging efficacy. Using features inspired by psycholinguistic norms on a corpus of motivational interviews from real patients, we demonstrate the ability to predict empathy with a high UAR. [Publication]

We also leveraged the multiple instance learning (MIL) paradigm for automatically estimating human behavioral patterns and gaining insight into human behavioral data. [Publication]


General Purpose GPU Graph Applications

Advisor: Murali Annavaram

As GPU's become more general purpose, graph applications have emerged as popular candidates for the hardware. However, several bottlenecks and stalls hinder performance on GPUs, in particular at the warp-level. We studied over 30 graph benchmarks and found bottlenecks such as long memory latencies and synchronization barriers to be major factors in the applications' performance degradation. These results aided in developing techniques in resource management and scheduling to diminish the major stalls.


Propagation Characteristics of Heterogeneous Composites of Materials at 60 GHz

Advisor: John Wawrzynek

To construct a high frequency wireless system simulator, we first needed to understand the reflection power and characteristics of various different types of materials. To collect this data, we constructed a mechanical setup onto which two antennas (transmitter and receiver) could be mounted and positioned at different angles from each other. Materials such as a classroom door, composite wall, opaque glass, and a desktop monitor were tested. We found the glass door and the classroom door to closest follow Fresnel's equations, which characterize the interactive properties between an electromagnetic wave and an interface.

This work was done as a part of the 2014 University of California, Berkeley SUPERB-ITS REU. It was presented at the 2014 SACNAS National Conference in Los Angeles, California, where it won a best poster award.





Research Students Mentored

Current

Fabio Ibanez (UG - Stanford CS CURIS)

Caleb Winston (Stanford CS PhD)

Former

Amulya Pillutla (UG - Stanford CS CURIS)

Aditi Partap (Stanford CS PhD)

Ioanna Vavelidou (Stanford EE PhD)

Clemente Farias (UG - Stanford CS CURIS)

Yale Wang (UG - Stanford EE REU)

Daniel Mendoza (Stanford EE MS) - Now PhD at Stanford

Gwen Ray (UG - Stanford CS CURIS)

Collin Kwon (UG - Stanford EE REU)