ARITH 2024: Program

ARITH 2024 Keynotes

“Report from IEEE WG P3109: Arithmetic Formats for Machine Learning”

Dr Andrew Fitzgibbon, Co-editor P3109 Working Documents

In the field of machine learning, there is increasing interest in and deployment of low-bit-width floating point formats. These formats, typically eight-bit, but increasingly even smaller, have proved valuable in reducing computational requirements while maintaining accuracy and utility of the machine learning algorithms on which our society increasingly depends for security, medicine, and numerous industrial applications in agriculture, entertainment, transport, and many others. The aim of WG P3109 is to synthesize existing practice and to define a set of arithmetic formats which best serve this growing community of practice. This talk will represent the discussions and key ideas and rationales behind the working group's current decisions, as represented in the interim report of September 2023. (See link at https://github.com/P3109/Public/blob/main/Shared%20Reports/P3109%20WG%20Interim%20Report.pdf

Andrew Fitzgibbon has contributed to some of the most significant machine learning and computer vision based products and innovations of the last few decades. At the University of Oxford, he co-founded the company "2d3", which developed the world's first commercially available automated camera tracking system, which was given the Emmy Award for "revolutionary impact on the creation of complex visual effects". At Microsoft, he contributed to the machine learning system behind the human motion capture system in Kinect for Xbox 360, one of the first systems to use massive synthetic training data to deliver a real-world computer vision application. More recently he contributed to the hand tracking algorithms in Microsoft's Hololens, the first commercially available AR headset with fully articulated hand tracking. At Graphcore, his recent work has been on programming languages and molecular machine learning. He is also a highly cited researcher, having authored numerous papers with over a thousand citations, and is inventor on dozens of patents.

“Immense-scale Machine Learning: The Big, the Small, and the Not Right At All”

Norman P. Jouppi, Google VP and Engineering Fellow.

We start with an introduction to the decade-long evolution of Google's Tensor Processing Systems, including the evolution of their numerics over time. We also discuss the unique requirements arising from serving billions of daily users across multiple products in production environments.

Training Large Language ML Models (LLMs) can require 100,000 accelerators working together in synchrony for months. And LLM inference can require exaFLOP/OP speeds with response times of under a second. This has driven adoption of lower-precision formats across the industry. Finally, since random errors from various sources are likely to occur at this immense scale during months-long training runs, fault tolerance is also a key feature of immense-scale ML systems. All these factors provide interesting challenges for computer arithmetic.

Norman P. Jouppi is a Google VP and Engineering Fellow. He joined Google in 2013 to lead the design of Google's Tensor Processing Units (TPUs). He is known for his innovations in computer memory systems, and was the principal architect and lead designer of several microprocessors. His innovations in microprocessor design have been adopted in many high-performance microprocessors.

Norm received his Ph.D. in electrical engineering from Stanford University in 1984. While at Stanford he was one of the principal architects and designers of the MIPS microprocessor, and developed techniques for MOS VLSI timing verification. He joined HP in 2002 through its merger with Compaq, where he was previously a Staff Fellow at Compaq's Western Research Laboratory (formerly DECWRL) in Palo Alto, California. In 2010 he was named an HP Senior Fellow. From 1984 through 1996 he was a consulting assistant/associate professor in the electrical engineering department at Stanford University where he taught courses in computer architecture, VLSI, and circuit design.

Norm holds more than 125 U.S. patents. He has published over 125 technical papers, with several best paper awards and two International Symposium on Computer Architecture (ISCA) Influential Paper Awards. He is the recipient of the 2014 IEEE Harry H. Goode Award and the 2015 ACM/IEEE Eckert-Mauchly Award. He is a Fellow of the ACM, IEEE, and AAAS, and a member of the National Academy of Engineering.

Symposium at a glance

Time (CEST)	Sunday, June 9th	Monday, June 10th	Tuesday, June 11th	Wednesday, June 12th
8:30		Registration	Registration
		Registration	Registration
9:00		Welcome	Session 4 Math Tools, Libraries and Software Evaluation	Session 6 Arithmetic Operators
9:30		Session 1 Arithmetic for Cryptography
......
......
11:00		Coffee Break	Coffee Break	Coffee Break
11:45		Keynote Talk Andrew Fitzgibbon	Keynote Talk Norman P. Jouppi	Session 7 Alternative formats
......		Keynote Talk Andrew Fitzgibbon	Keynote Talk Norman P. Jouppi	Session 7 Alternative formats
12:45		Lunch	Lunch	Close
......				Lunch
......				Lunch
14:15		Session 2 Datapath Design I	Session 5 Transcendental functions and error analysis
......
......
15:45		Coffee Break	Coffee Break	Malaga Cathedral + Alcazaba Tour
16:30		Session 3 Datapath Design II	Steering Committee Meeting (SC Members only)
......		Session 3 Datapath Design II
17:30
......
......
......
19:00
19:15			Reception at the Málaga City Hall
20:00	Welcome Reception	Visit to the historic center of Málaga (dinner included after tour)	Banquet (Kaleido Malaga Port)	Catamaran Experience
......
......

Practical information

All coffee breaks and lunches are included in the registration fee, as well as Welcome Reception and Banquet.

Dinner on Monday is included for attendants who booked the guided tour through the city center.

Dinner on Wednesday is not covered by the conference.

More information on the social program can be found in the dedicated page.

Disclaimer: The opinions expressed in the papers on this website are the opinions of the authors/speakers and not necessarily the opinions of the IEEE or of the conference and its organizers.

Detailed Program — Subject to Change

All times are displayed in Central European Summer Time (UTC+02:00)

Monday June 10th, 2024

08:30 - 09:00 Registration

09:00 - 09:30 Welcome

09:30 - 11:00 Session 1: Arithmetic for Cryptography (chair: Leonel Sousa)

		Hardware Acceleration of the Prime-Factor and Rader NTT for BGV Fully Homomorphic Encryption David Du Pont (KU Leuven, Belgium), Jonas Bertels, Furkan Turan, Michiel Van Beirendonck and Ingrid Verbauwhede (COSIC KU Leuven, Belgium)
		PQC-AMX: accelerating Saber and FrodoKEM on the Apple M1 and M3 SoCs Décio Luiz Gazzoni Filho (UNICAMP, Brazil), Guilherme Brandão (Independent Researcher, Londrina, Brazil), Gora Adj, Arwa Alblooshi, Isaac Canales-Martínez and Jorge Chávez-Saab (Cryptography Research Centre, UAE) and Julio López (UNICAMP, Brazil)
		Montgomery Modular Multiplication via Single-Base Residue Number Systems Zabihollah Ahmadpour (Shahid Beheshti University, Iran), Ghassem Jaberipur and Jeong-A Lee (Chosun University, South Korea)

11:00 - 11:45 Coffee break

11:45 - 12:45 Keynote Talk

Report from IEEE WG P3109: Arithmetic Formats for Machine Learning
Andrew Fitzgibbon (Graphcore, UK )

12:45 - 14:15 Lunch

14:15 - 15:45 Session 2: Datapath Design I (chair: Mioara Joldes)

Combining Power and Arithmetic Optimization via Datapath Rewriting
Samuel Coward (Intel, UK), Theo Drane and Emiliano Morini (Intel, USA) and George Constantinides (Imperial College London, UK)

Useful applications of correctly-rounded operators of the form ab+cd+e
Tom Hubrecht (Ecole Normale Supérieure, Paris, France), Claude-Pierre Jeannerod (Inria, LIP) and Jean-Michel Muller (CNRS)

Fused FP8 4-Way Dot Product with Scaling and FP32 Accumulation
David Lutz and Anisha Saini (Arm, USA), Mairin Kroes (Arm, UK), Thomas Elmer and Harsha Valsaraju (Arm, USA)

15:45 - 16:30 Coffee break

16:30 - 17:30 Session 3: Datapath Design II (chair: Bogdan Pasca)

		Multiple-base Logarithmic Quantization and Application in Reduced Precision AI Computations Vassil Dimitrov (University of Calgary & Lemurian Labs, Canada), Richard Ford (Lemurian Labs, USA), Laurent Imbert (CNRS, University of Montpellier & Lemurian Labs, France), Arjuna Madanayake (Lemurian Labs, USA) , Nilan Udayanga (Lemurian Labs, Sri Lanka) and Will Wray (Lemurian Labs, Canada)
		Novel Access Patterns based on Overlapping Loading and Processing Times to Reduce Latency and Increase Throughput in Memory-based FFTs Zeynep Kaya (Bilecik Seyh Edebali University, Turkey) and Mario Garrido (Polytechnic University of Madrid, Spain)

20:00 - 21:00 Visit to the historic center of Málaga (dinner included after tour).

Tuesday June 11th, 2024

08:30 - 09:00 Registration

09:00 - 11:00 Session 4: Math Tools, Libraries and Software Evaluation (chair: Theo Drane)

		An Open-Source RISC-V Vector Math Library Ping Tang (Rivos Inc, USA)
		MATLAB Simulator of Level-Index Arithmetic Mantas Mikaitis (University of Leeds, UK)
		APyTypes: Algorithmic Data Types in Python for Efficient Simulation of Finite Word-Length Effects Mikael Henriksson, Theodor Lindberg and Oscar Gustafsson (Linköping University, Sweden)
		An Emacs-Cairo Scrolling Bug due to Floating-Point Inaccuracy Vincent Lefèvre (INRIA, France)

11:00 - 11:45 Coffee break

11:45 - 12:45 Keynote Talk

Immense-scale Machine Learning: The Big, the Small, and the Not Right At All
Norman P. Jouppi (Google, USA)

12:45 - 14:15 Lunch

14:15 - 15:45 Session 5: Transcendental functions and error analysis (chair: Mantas Mikaitis)

		Fast multiple precision exp(x) with precomputations Joris van der Hoeven (CNRS, France) and Fredrik Johansson (INRIA Bordeaux, France)
		HGH-CORDIC: A High-Radix Generalized Hyperbolic Coordinate Rotation Digital Computer Hui Chen, Lianghua Quan and Weiqiang Liu (Nanjing University, China)
		Rounding Error Analysis of an Orbital Collision Probability Evaluation Algorithm Denis Arzelier, Florent Bréhard, Mioara Joldes and Marc Mezzarobba (CNRS, France)

15:45 - 16:30 Coffee break

19:15 - 20:00 Reception by the Mayor at the Málaga City Hall

20:00 - 22:00 Banquet (Kaleido Málaga Port)

Wednesday June 12th, 2024

09:00 - 11:00 Session 6: Arithmetic Operators (chair: Christoph Lauter)

		Multiplier Architecture with a Carry-Based Partial Product Encoding Martin Langhammer (Intel, UK), Bogdan Pasca (Intel, France) and Igor Kucherenko (Intel, USA)
		On the Systematic Creation of Faithfully Rounded Commutative Truncated Booth Multipliers Theo Drane (Intel, USA) , Samuel Coward (Intel, UK) and Mertcan Temel, Joe Leslie-Hurd (Intel, USA)
		A Time Efficient Comprehensive Model of Approximate Multipliers for Design Space Exploration Ziying Cui, Ke Chen, Bi Wu, Chenggang Yan, Yu Gong and Weiqiang Liu (Nanjing University of Aeronautics and Astronautics, China)
		Small Logic-based Multipliers with Incomplete Sub-Multipliers for FPGAs Andreas Boettcher and Martin Kumm (Fulda University of Applied Sciences, Germany)

11:00 - 11:45 Coffee break

11:45 - 12:45 Session 7: Alternative formats (chair: Florent de Dinechin)

		Square Root Unit with Minimum Iterations for Posit Arithmetic Raul Murillo, Alberto A. Del Barrio and Guillermo Botella (Complutense University of Madrid, Spain)
		PT-Float: A Floating-Point Unit with Dynamically Varying Exponent and Fraction Sizes Micaela Serôdio, João Lopes, José Sousa, Horácio Neto and Mário Véstias (University of Lisbon, Portugal)

12:45 - 13:00 Close

13:00 - 14:15 Lunch

16:00 - 19:00 Sightseeing tour of Málaga (Cathedral and Alcazaba)

20:00 - 21:30 Catamaran Experience

ARITH 2024 ACCEPTED PAPERS

ARITH 2024 Accepted Papers with Abstracts

31^st IEEE International Symposium on Computer Arithmetic
ARITH 2024

Málaga, Spain. June 10-12, 2024.

ARITH 2024 Keynotes

“Report from IEEE WG P3109: Arithmetic Formats for Machine Learning”

“Immense-scale Machine Learning: The Big, the Small, and the Not Right At All”

Symposium at a glance

Practical information