Publications

(2024). Fainder: A Fast and Accurate Index for Distribution-Aware Dataset Search. In VLDB 2024.

PDF Cite Code

(2024). SPO-Join: Efficient Stream Inequality Join. In EDBT 2025.

PDF Cite Code

(2024). Reactive Dataflow for Inflight Error Handling in ML Workflows. In DEEM@SIGMOD.

PDF Cite Code

(2023). Apache Wayang: A Unified Data Analytics Framework. In SIGMOD Record.

PDF Code

(2023). XDB in Action: Decentralized Cross-Database Query Processing for Black-Box DBMSes. In PVLDB.

PDF Cite

(2023). P2D: A Transpiler Framework for Optimizing Data Science Pipelines. In DEEM@SIGMOD.

PDF Cite

(2023). In-Situ Cross-Database Query Processing. In ICDE.

PDF Cite

(2022). Navigating Compliance with Data Transfers in Federated Data Processing. In IEEE Data Eng. Bull..

PDF Cite

(2021). Compliant Geo-distributed Data Processing in Action. In PVLDB.

PDF Cite

(2021). Compliant Geo-distributed Query Processing. In SIGMOD.

PDF Cite

(2021). AdCom: Adaptive Combiner for Streaming Aggregations. In EDBT.

PDF Cite Code

(2019). A Unified Framework for Frequent Sequence Mining with Subsequence Constraints. In ACM Trans. Database Syst. (TODS).

PDF Cite Code

(2019). The DESQ Framework for Declarative and Scalable Frequent Sequence Mining. In INFORMATIK.

PDF Cite Code Slides

(2019). Resense: Transparent Record and Replay of Sensor Data in the Internet of Things. In EDBT.

PDF Cite Code

(2015). Closing the Gap: Sequence Mining at Scale. In ACM Trans. Database Syst. (TODS).

PDF Cite Code

(2013). Fully Parallel Inference in Markov Logic Networks. In BTW.

PDF Cite Slides