ARITH 2024 Keynotes

“Report from IEEE WG P3109: Arithmetic Formats for Machine Learning”

Dr Andrew Fitzgibbon, Co-editor P3109 Working Documents

In the field of machine learning, there is increasing interest in and deployment of low-bit-width floating point formats. These formats, typically eight-bit, but increasingly even smaller, have proved valuable in reducing computational requirements while maintaining accuracy and utility of the machine learning algorithms on which our society increasingly depends for security, medicine, and numerous industrial applications in agriculture, entertainment, transport, and many others. The aim of WG P3109 is to synthesize existing practice and to define a set of arithmetic formats which best serve this growing community of practice. This talk will represent the discussions and key ideas and rationales behind the working group's current decisions, as represented in the interim report of September 2023. (See link at https://github.com/P3109/Public/blob/main/Shared%20Reports/P3109%20WG%20Interim%20Report.pdf

 

 

 

Andrew Fitzgibbon has contributed to some of the most significant machine learning and computer vision based products and innovations of the last few decades. At the University of Oxford, he co-founded the company "2d3", which developed the world's first commercially available automated camera tracking system, which was given the Emmy Award for "revolutionary impact on the creation of complex visual effects". At Microsoft, he contributed to the machine learning system behind the human motion capture system in Kinect for Xbox 360, one of the first systems to use massive synthetic training data to deliver a real-world computer vision application. More recently he contributed to the hand tracking algorithms in Microsoft's Hololens, the first commercially available AR headset with fully articulated hand tracking. At Graphcore, his recent work has been on programming languages and molecular machine learning. He is also a highly cited researcher, having authored numerous papers with over a thousand citations, and is inventor on dozens of patents.

 

“Immense-scale Machine Learning: The Big, the Small, and the Not Right At All”

Norman P. Jouppi, Google VP and Engineering Fellow.

We start with an introduction to the decade-long evolution of Google's Tensor Processing Systems, including the evolution of their numerics over time. We also discuss the unique requirements arising from serving billions of daily users across multiple products in production environments.

Training Large Language ML Models (LLMs) can require 100,000 accelerators working together in synchrony for months. And LLM inference can require exaFLOP/OP speeds with response times of under a second. This has driven adoption of lower-precision formats across the industry. Finally, since random errors from various sources are likely to occur at this immense scale during months-long training runs, fault tolerance is also a key feature of immense-scale ML systems. All these factors provide interesting challenges for computer arithmetic.

 

 

 

Norman P. Jouppi is a Google VP and Engineering Fellow. He joined Google in 2013 to lead the design of Google's Tensor Processing Units (TPUs). He is known for his innovations in computer memory systems, and was the principal architect and lead designer of several microprocessors. His innovations in microprocessor design have been adopted in many high-performance microprocessors.

Norm received his Ph.D. in electrical engineering from Stanford University in 1984. While at Stanford he was one of the principal architects and designers of the MIPS microprocessor, and developed techniques for MOS VLSI timing verification. He joined HP in 2002 through its merger with Compaq, where he was previously a Staff Fellow at Compaq's Western Research Laboratory (formerly DECWRL) in Palo Alto, California. In 2010 he was named an HP Senior Fellow. From 1984 through 1996 he was a consulting assistant/associate professor in the electrical engineering department at Stanford University where he taught courses in computer architecture, VLSI, and circuit design.

Norm holds more than 125 U.S. patents. He has published over 125 technical papers, with several best paper awards and two International Symposium on Computer Architecture (ISCA) Influential Paper Awards. He is the recipient of the 2014 IEEE Harry H. Goode Award and the 2015 ACM/IEEE Eckert-Mauchly Award. He is a Fellow of the ACM, IEEE, and AAAS, and a member of the National Academy of Engineering.

 

Symposium at a glance

Time
(CEST)
Sunday,
June 9th
Monday, June 10th Tuesday, June 11th Wednesday, June 12th
8:30 Registration Registration
9:00 Welcome
Session 4
Math Tools, Libraries and Software Evaluation
Session 6
Arithmetic Operators
9:30 Session 1
Arithmetic for Cryptography
......
......
11:00 Coffee Break Coffee Break Coffee Break
11:45 Keynote Talk
Andrew Fitzgibbon
Keynote Talk
Norman P. Jouppi
Session 7
Alternative formats
......
12:45 Lunch Lunch Close
...... Lunch
......
14:15 Session 2
Datapath Design I
Session 5
Transcendental functions and error analysis
......
......
15:45 Coffee Break Coffee Break Malaga Cathedral + Alcazaba Tour
16:30 Session 3
Datapath Design II
Steering Committee Meeting
(SC Members only)
......
17:30
......
......
......
19:00
19:15 Reception at the Málaga City Hall
20:00 Welcome Reception Visit to the historic center of Málaga (dinner included after tour) Banquet
(Kaleido Malaga Port)
Catamaran Experience
......
......

Practical information

All coffee breaks and lunches are included in the registration fee, as well as Welcome Reception and Banquet.

Dinner on Monday is included for attendants who booked the guided tour through the city center.

Dinner on Wednesday is not covered by the conference.

More information on the social program can be found in the dedicated page.

Disclaimer: The opinions expressed in the papers on this website are the opinions of the authors/speakers and not necessarily the opinions of the IEEE or of the conference and its organizers.


Detailed Program — Subject to Change

All times are displayed in Central European Summer Time (UTC+02:00)

Monday June 10th, 2024

08:30 - 09:00 Registration

09:00 - 09:30 Welcome

09:30 - 11:00Session 1: Arithmetic for Cryptography (chair: Leonel Sousa)

pdf pdf Hardware Acceleration of the Prime-Factor and Rader NTT for BGV Fully Homomorphic Encryption
David Du Pont (KU Leuven, Belgium), Jonas Bertels, Furkan Turan, Michiel Van Beirendonck and Ingrid Verbauwhede (COSIC KU Leuven, Belgium)
pdf pdf PQC-AMX: accelerating Saber and FrodoKEM on the Apple M1 and M3 SoCs
Décio Luiz Gazzoni Filho (UNICAMP, Brazil), Guilherme Brandão (Independent Researcher, Londrina, Brazil), Gora Adj, Arwa Alblooshi, Isaac Canales-Martínez and Jorge Chávez-Saab (Cryptography Research Centre, UAE) and Julio López (UNICAMP, Brazil)
pdf pdf Montgomery Modular Multiplication via Single-Base Residue Number Systems
Zabihollah Ahmadpour (Shahid Beheshti University, Iran), Ghassem Jaberipur and Jeong-A Lee (Chosun University, South Korea)

11:00 - 11:45 Coffee break

11:45 - 12:45Keynote Talk

pdf Report from IEEE WG P3109: Arithmetic Formats for Machine Learning
Andrew Fitzgibbon (Graphcore, UK )

12:45 - 14:15 Lunch

14:15 - 15:45Session 2: Datapath Design I (chair: Mioara Joldes)

pdf pdf Combining Power and Arithmetic Optimization via Datapath Rewriting
Samuel Coward (Intel, UK), Theo Drane and Emiliano Morini (Intel, USA) and George Constantinides (Imperial College London, UK)
pdf pdf
Best paper award Useful applications of correctly-rounded operators of the form ab+cd+e
Tom Hubrecht (Ecole Normale Supérieure, Paris, France), Claude-Pierre Jeannerod (Inria, LIP) and Jean-Michel Muller (CNRS)
pdf pdf Fused FP8 4-Way Dot Product with Scaling and FP32 Accumulation
David Lutz and Anisha Saini (Arm, USA), Mairin Kroes (Arm, UK), Thomas Elmer and Harsha Valsaraju (Arm, USA)

15:45 - 16:30 Coffee break

16:30 - 17:30Session 3: Datapath Design II (chair: Bogdan Pasca)

pdf pdf Multiple-base Logarithmic Quantization and Application in Reduced Precision AI Computations
Vassil Dimitrov (University of Calgary & Lemurian Labs, Canada), Richard Ford (Lemurian Labs, USA), Laurent Imbert (CNRS, University of Montpellier & Lemurian Labs, France), Arjuna Madanayake (Lemurian Labs, USA) , Nilan Udayanga (Lemurian Labs, Sri Lanka) and Will Wray (Lemurian Labs, Canada)
pdf pdf Novel Access Patterns based on Overlapping Loading and Processing Times to Reduce Latency and Increase Throughput in Memory-based FFTs
Zeynep Kaya (Bilecik Seyh Edebali University, Turkey) and Mario Garrido (Polytechnic University of Madrid, Spain)

20:00 - 21:00 Visit to the historic center of Málaga (dinner included after tour).


Tuesday June 11th, 2024

08:30 - 09:00 Registration

09:00 - 11:00Session 4: Math Tools, Libraries and Software Evaluation (chair: Theo Drane)

pdf pdf An Open-Source RISC-V Vector Math Library
Ping Tang (Rivos Inc, USA)
pdf pdf MATLAB Simulator of Level-Index Arithmetic
Mantas Mikaitis (University of Leeds, UK)
pdf pdf APyTypes: Algorithmic Data Types in Python for Efficient Simulation of Finite Word-Length Effects
Mikael Henriksson, Theodor Lindberg and Oscar Gustafsson (Linköping University, Sweden)
pdf pdf An Emacs-Cairo Scrolling Bug due to Floating-Point Inaccuracy
Vincent Lefèvre (INRIA, France)

11:00 - 11:45 Coffee break

11:45 - 12:45Keynote Talk

pdf Immense-scale Machine Learning: The Big, the Small, and the Not Right At All
Norman P. Jouppi (Google, USA)

12:45 - 14:15 Lunch

14:15 - 15:45Session 5: Transcendental functions and error analysis (chair: Mantas Mikaitis)

pdf pdf Fast multiple precision exp(x) with precomputations
Joris van der Hoeven (CNRS, France) and Fredrik Johansson (INRIA Bordeaux, France)
pdf pdf HGH-CORDIC: A High-Radix Generalized Hyperbolic Coordinate Rotation Digital Computer
Hui Chen, Lianghua Quan and Weiqiang Liu (Nanjing University, China)
pdf pdf Rounding Error Analysis of an Orbital Collision Probability Evaluation Algorithm
Denis Arzelier, Florent Bréhard, Mioara Joldes and Marc Mezzarobba (CNRS, France)

15:45 - 16:30 Coffee break

19:15 - 20:00 Reception by the Mayor at the Málaga City Hall

20:00 - 22:00 Banquet (Kaleido Málaga Port)


Wednesday June 12th, 2024

09:00 - 11:00Session 6: Arithmetic Operators (chair: Christoph Lauter)

pdf pdf Multiplier Architecture with a Carry-Based Partial Product Encoding
Martin Langhammer (Intel, UK), Bogdan Pasca (Intel, France) and Igor Kucherenko (Intel, USA)
pdf pdf On the Systematic Creation of Faithfully Rounded Commutative Truncated Booth Multipliers
Theo Drane (Intel, USA) , Samuel Coward (Intel, UK) and Mertcan Temel, Joe Leslie-Hurd (Intel, USA)
pdf pdf A Time Efficient Comprehensive Model of Approximate Multipliers for Design Space Exploration
Ziying Cui, Ke Chen, Bi Wu, Chenggang Yan, Yu Gong and Weiqiang Liu (Nanjing University of Aeronautics and Astronautics, China)
pdf pdf Small Logic-based Multipliers with Incomplete Sub-Multipliers for FPGAs
Andreas Boettcher and Martin Kumm (Fulda University of Applied Sciences, Germany)

11:00 - 11:45 Coffee break

11:45 - 12:45Session 7: Alternative formats (chair: Florent de Dinechin)

pdf pdf Square Root Unit with Minimum Iterations for Posit Arithmetic
Raul Murillo, Alberto A. Del Barrio and Guillermo Botella (Complutense University of Madrid, Spain)
pdf pdf PT-Float: A Floating-Point Unit with Dynamically Varying Exponent and Fraction Sizes
Micaela Serôdio, João Lopes, José Sousa, Horácio Neto and Mário Véstias (University of Lisbon, Portugal)

12:45 - 13:00 Close

13:00 - 14:15 Lunch

16:00 - 19:00 Sightseeing tour of Málaga (Cathedral and Alcazaba)

20:00 - 21:30 Catamaran Experience


ARITH 2024 ACCEPTED PAPERS

ARITH 2024 Accepted Papers with Abstracts