Reynold S. Xin

I am a PhD student at UC Berkeley in the AMPLab and the Database Group, advised by Michael Franklin. Prior to Berkeley, I worked on AdSense infrastructure at Google and distributed databases at IBM. Before that, I received my Bachelor's degree in Engineering Science from the University of Toronto, advised by Renée Miller.

I enjoy traveling, playing badminton and squash.

Some recent projects:

Shark: Shark is a large-scale data warehouse system for Spark designed to be compatible with Apache Hive. It can answer Hive QL queries up to 30 times faster than Hive without modification to the existing data nor queries.

CrowdDB: I led the initial development of CrowdDB, which can answer queries (using crowdsourcing) even if the database does not have the necessary data. The prototype led to a visionary paper in SIGMOD and won the inaugural Best Demo Award at VLDB.

Dataspaces: I worked Alon Halevy's group at Google Research on data integration at web scale.