Published Research

Random Forest Similarity Maps

Random Forest Similarity Maps: A Scalable Visual Representation for Global and Local Interpretation

MDPI Electronics, 2023

This paper explores scalable interpretability techniques for Random Forest models using similarity-based visual representations, enabling both global and local understanding of ensemble decision behavior without sacrificing scalability.

Read on MDPI · PDF


The Data Lakehouse

The Data Lakehouse: Data Warehousing and More

arXiv, 2023

This paper examines the evolution of data architectures leading to the lakehouse paradigm—bridging the gap between traditional data warehouses and data lakes—and discusses key system design considerations and trends.

Read on arXiv · PDF