Luca de Alfaro: Projects

Luca de Alfaro

Professor

luca@ucsc.edu

Computer Science and Engineering

University of California, Santa Cruz

Ph.D. Stanford University, 1998

Office: Bld E2, Rm 339A

Programming in Natural Language

Nowadays, people code mostly as follows:

they write beautiful and precise descriptions in English (or other languagges) of what should be done,
they ask AI to produce code,
... and ... they throw out the natural language descriptions!

This is a shame. It is as if someone wrote very carefully a C++ program, compiled it, and then once the binary works, threw out the C++ code because "the binary works". The problem with this approach is that the natural language description can be so much more useful, as everybody, and not only programmers, can read and modify them!

We aim to answer the question: what would it take for natural languages to replace programming languages? What do we need to give people the confidence that their natural language descriptions are correctly implemented? What do we need to replace the programming-language ecosystem with a natural-language one? We have various projects in this area. They include these (we will add more as we go along).

Plainbook

This project aims to develop notebooks like the Jupyter ones, except, in natural language. AI is used to convert the natural language to code and to check that the code correctly implements the natural language. Multiple AIs can be used, so that one does not need to trust a single one.

Fairness in Machine Learning

The goal of this project is to develop the theory and the tools that enable people, from data scientists to non-specialists, to analyze datasets, and identify anomalous behavior that has potential fairness implications. This is joint work with Elena Baralis and Eliana Pastor.

We have just released DivExplorer: a Python package, and an associated Web Application, that enables anyone to analyze a dataset, and identify data subgroups where a classifier behaves in a way that deviates from the average. This can be used, for instance, to identify who benefits, and who is disadvantages, from errors in machine-learning classifiers. We are looking for contributors!

Short Video

Project Site

Python Package

Web App

Selected Publications

Prioritizing Data Acquisition For End-To-End Speech Model Improvement. A. Kodounas, E. Pastor, G. Attanasio, L. de Alfaro, E. Baralis. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024.

Leveraging Confidence Model For Identifying Challenging data Subgroups in Speech Models. A. Kodounas, E. Pastor, V. Mazzia, M. Giollo, T. Gueudre, E. Reale, G. Attanasio, L. Cagliero, S. Cumani, L. de Alfaro, E. Baralis, D. Amberti. In Proceedings of IEEE ICASSP Workshop on Trustworthy Speech Processing, 2024.

Towards Comprehensive Subgroup Performance Analysis in Speech Models. A. Kodounas, E. Pastor, G. Attanasio, V. Mazzia, M. Giollo, T. Gueudre, E. Reale, L. Cagliero, L. de Alfaro, E. Baralis, D. Amberti. IEEE Transactions on Audio, Speech, and Language Processing, 32(1468-1480), 2024. Supplementary material.

Exploring Subgroup Performance in End-To-End Speech Models. A. Kodounas, E. Pastor, G. Attanasio, V. Mazzia, M. Giollo, T. Gueudre, L. Cagliero, L. de Alfaro, E. Baralis, D. Amberti. In Proceedings of the International Conference on Acoustings, Speech, and Signal Processing (ICASSP), 2023.

A Hierarchical Approach to Anomalous Subgroup Discovery. E. Pastor, E. Baralis, L. de Alfaro. In Proceedings of the 39th IEEE International Conference on Data Engineering (ICDE), 2023.

Identifying Biased Subgroups in Ranking and Classification. E. Pastor, L. de Alfaro, E. Baralis. In Proceedings of the Responsible AI @ KDD 2021 Workshop, 2021.

How Divergent Is Your Data? E. Pastor, A. Gavgavian, E. Baralis, L. de Alfaro. In Proceedings of the 47th International Conference on Very Large Data Bases (VLDB), Demo Track, 2021.

Looking for Trouble: Analyzing Classifier Behavior via Pattern Divergence. E. Pastor, L. de Alfaro, E. Baralis. In Proceedings of the 2021 ACM SIGMOD Conference, 2021.

Computational Ecology

Natalia Ocampo-Peñuela and I are starting a project at the intersection of computer science and ecology. The project will apply the methods of neural computation, ML, optimization, and parallel processing to the mapping and analysis of animal distributions, and in particular, of birds. We want to develop computational tools to analyze habitat changes, such as urbaninzation and natural disasters, and to propose interventions such as habitat protections that benefit species survival. This project is just starting, and we are looking for students who might be interested in joining it. In particular, we welcome applications from prospective Ph.D. students. Contact us if you are interested.

NotebookGrader

NotebookGrader is an open source web app for assigning and grading Python notebooks. Instructors can create assignments consisting of Python notebooks. When students join the assignments, they are shared a notebook on Google Colab on which they can work. They have a grade button; when they press it, they get both a grade, and feedback. The code can be found on GitHub. NotebookGrader has been written to facilitate teaching at UC Santa Cruz, and to be a vehicle for experimentation in innovative techniques in teaching and education.

Reinforcement learning for networks

In current network protocols, much of the behavior is hardwired in rules that prescribe when to transmit packets, how frequently to send them, how to route them, and so forth. The project seeks to leverage reinforcement learning to obtain protocols that adapt to their environment and achieve both greater efficiency, and greater robustness via their adaptation capability. We have developed techniques for channel access in wireless networks that, by using reinforcement learning, enable network nodes to co-adapt and collaborate to achieve fair and efficient bandwidth sharing. We are now looking at channel access, routing, congestion control, and more.

Publications

Adaptive Policy Tree Algorithm to Approach Collision-Free Transmissions in Slotted ALOHA. M. Zhang, L. de Alfaro, M. Mosko, C. Funai, T. Upthegrove, B. Thapa, D. Javorsek, J.J. Garcia-Luna-Aceves. In Proceedings of the 17th International Conference on Mobile Ad-Hoc and Smart Systems (IEEE MASS), 2020.

Using Reinforcement Learning in Slotted Aloha for Ad-Hoc Networks. M. Zhang, L. de Alfaro, JJ. Garcia-Luna-Aceves. In Proceedings of the 23rd International ACM Conference on Modeling, Analysis, and Simulation of Wireless and Mobile Systems (MSWiM), pages 245-252, 2020.

An Adaptive Tree Algorithm to Approach Collision-Free Transmission in Slotted ALOHA. M. Zhang, L. de Alfaro, J.J. Garcia-Luna-Aceves. In Proceedings of the ACM SIGCOMM 2020 Workshop on Network Meets AI & ML (NetAI), 2020.

Approaching Fair Collision-Free Channel Access with Slotted ALOHA Using Collaborative Policy-Based Reinforcement Learning. L. de Alfaro, M. Zhang, J.J. Garcia-Luna-Aceves. In Proceedings of the IFIP Networking 2020 Conference, 2020.