Network Working Group

Internet Architecture Board (IAB)                           N. ten Oever
Internet-Draft
Request for Comments: 9307                       University of Amsterdam
Intended status:
Category: Informational                                          C. Cath
Expires: 1 December 2022
ISSN: 2070-1721                                  University of Cambridge
                                                            M. Kühlewind
                                                                Ericsson
                                                           C. S. Perkins
                                                   University of Glasgow
                                                             30 May
                                                             August 2022

     Report from the IAB Workshop on Analyzing IETF Data (AID), (AID) 2021
                       draft-iab-aid-workshop-01

Abstract

   The 'Show "Show me the numbers: Workshop on Analyzing IETF Data (AID)' (AID)"
   workshop was convened by the Internet Architecture Board (IAB) from
   November 29 to December 2 2, 2021 and hosted by the IN-SIGHT.it project
   at the University of
   Amsterdam, Amsterdam; however, it was converted to an online only
   online-only event.  The workshop was conducted based on organized into two discussion
   parts and with a hackathon activity in between.  This report summarizes
   the workshop's discussion and identifies topics that warrant future
   work and consideration.

   Note that this document is a report on the proceedings of the
   workshop.  The views and positions documented in this report are
   those of the workshop participants and do not necessarily reflect IAB
   views and positions.

Discussion Venues

   This note is to be removed before publishing as an RFC.

   Source for this draft and an issue tracker can be found at
   https://github.com/intarchboard/workshop-aid.

Status of This Memo

   This Internet-Draft document is submitted in full conformance with the
   provisions of BCP 78 and BCP 79.

   Internet-Drafts are working documents not an Internet Standards Track specification; it is
   published for informational purposes.

   This document is a product of the Internet Engineering
   Task Force (IETF).  Note Architecture Board (IAB)
   and represents information that other groups may also distribute
   working documents as Internet-Drafts.  The list the IAB has deemed valuable to
   provide for permanent record.  It represents the consensus of current Internet-
   Drafts is at https://datatracker.ietf.org/drafts/current/.

   Internet-Drafts the
   Internet Architecture Board (IAB).  Documents approved for
   publication by the IAB are draft documents valid not candidates for a maximum any level of six months Internet
   Standard; see Section 2 of RFC 7841.

   Information about the current status of this document, any errata,
   and how to provide feedback on it may be updated, replaced, or obsoleted by other documents obtained at any
   time.  It is inappropriate to use Internet-Drafts as reference
   material or to cite them other than as "work in progress."

   This Internet-Draft will expire on 1 December 2022.
   https://www.rfc-editor.org/info/rfc9307.

Copyright Notice

   Copyright (c) 2022 IETF Trust and the persons identified as the
   document authors.  All rights reserved.

   This document is subject to BCP 78 and the IETF Trust's Legal
   Provisions Relating to IETF Documents (https://trustee.ietf.org/
   license-info)
   (https://trustee.ietf.org/license-info) in effect on the date of
   publication of this document.  Please review these documents
   carefully, as they describe your rights and restrictions with respect
   to this document.  Code Components
   extracted from this document must include Revised BSD License text as
   described in Section 4.e of the Trust Legal Provisions and are
   provided without warranty as described in the Revised BSD License.

Table of Contents

   1.  Introduction  . . . . . . . . . . . . . . . . . . . . . . . .   3
   2.  Workshop Scope and Discussion . . . . . . . . . . . . . . . .   3
     2.1.  Tools, data, Data, and methods  . . . . . . . . . . . . . . . .   3 Methods
     2.2.  Observations on affiliation Affiliation and industry control  . . . .   4 Industry Control
     2.3.  Community and diversity . . . . . . . . . . . . . . . . .   5 Diversity
     2.4.  Publications, process, Process, and decision-making  . . . . . . .   6 Decision Making
     2.5.  Environmental Sustainability  . . . . . . . . . . . . . .   7
   3.  Hackathon Report  . . . . . . . . . . . . . . . . . . . . . .   7
   4.  Position Papers . . . . . . . . . . . . . . . . . . . . . . .   8
     4.1.  Tools, data, Data, and methods  . . . . . . . . . . . . . . . .   8 Methods
     4.2.  Observations on affiliation Affiliation and industry control  . . . .   8 Industry Control
     4.3.  Community and diversity . . . . . . . . . . . . . . . . .   9 Diversity
     4.4.  Publications, process, Process, and decision-making  . . . . . . .   9 Decision Making
     4.5.  Environmental Sustainability  . . . . . . . . . . . . . .  10
   5.  Workshop participants . . . . . . . . . . . . . . . . . . . .  10
   6.  Informative References
   Appendix A.  Data Taxonomy
   Appendix B.  Program Committee . . . . . . . . . . . . . . . . . . . . . .  10
   7.
   Appendix C.  Workshop Participants
   IAB Members at the Time of Approval
   Acknowledgments . . . . . . . . . . . . . . . . . . . . . . .  10
   8.  Annexes . . . . . . . . . . . . . . . . . . . . . . . . . . .  11
     8.1.  Annex 1 - Data Taxonomy . . . . . . . . . . . . . . . . .  11
   Authors' Addresses  . . . . . . . . . . . . . . . . . . . . . . .  13

1.  Introduction

   The IETF, as an international Standards Developing Organization
   (SDO), hosts a diverse set of data including on about the organization's
   history, development, IETF's history and
   development, current standardization standardisation activities,
   including of Internet protocols protocols,
   and its institutions. the institutions that comprise the IETF.  A large portion of this
   data is publicly available, yet it is underutilized as a tool to
   inform the work in the IETF proper or the broader research community that is
   focused on topics like Internet governance and trends in ICT standard-setting. information
   and communication technologies (ICT) standard setting.

   The aim of the IAB "IAB Workshop on Analyzing IETF Data (AID) 2021 2021"
   workshop was to study how IETF data is currently used, to understand
   what insights can be drawn from that data, and to explore open
   questions around how that data may be further used in the future.

   These questions can inform a research agenda drawing from IETF data, data
   that fosters further collaborative work among interested parties,
   ranging from academia and civil society to industry and IETF
   leadership.

2.  Workshop Scope and Discussion

   The workshop was organized with two all-group discussion slots at the
   beginning and the end of the workshop.  In between between, the workshop
   participants organized hacakthon activities, hackathon activities based on topics
   identifed
   identified during the initial discussion and in submitted position
   papers.  The follow following topic areas have been were identified and discussed.

2.1.  Tools, data, Data, and methods Methods

   The IETF holds a wide range of data sources.  The main ones used are
   the mailinglist archives (https://mailarchive.ietf.org/arch/), [Mail-Arch], RFCs
   (https://www.ietf.org/standards/rfcs/), [IETF-RFCs], and the
   datatracker
   (https://datatracker.ietf.org/). [Datatracker].  The latter provides information on
   participants, authors, meeting proceedings, minutes minutes, and more
   (https://notes.ietf.org/iab-aid-datatracker-database-overview#).
   Furthermore
   [Data-Overview].  Furthermore, there are statistics for the IETF
   websites
   (https://www.ietf.org/policies/web-analytics/), [IETF-Statistics], the working group Github repositories,
   and the IETF survey data (https://www.ietf.org/blog/ietf-
   community-survey-2021/) and there [Survey-Data].  There was discussion about
   the utility of download statistics for the RFCs itself themselves from
   different repos.

   There are is a wide range of tools to analyze this data, data produced by IETF
   participants or researchers interestested interested in the work of the IETF.  Two
   projects that presented their work at the workshop were BigBang (https://bigbang-py.readthedocs.io/en/latest/)
   [BigBang] and Sodestream's IETFdata (https://github.com/glasgow-ipl/ietfdata)
   library; the [ietfdata] library.  The RFC
   Prolog Database was described in a submitted paper (see Section Section 4
   below).  These projects could be used to add provide additional insights to the insight into
   existing IETF statistics
   (https://www.arkko.com/tools/docstats.html) page [ArkkoStats] and the datatracker statistics (https://datatracker.ietf.org/stats/),
   [DatatrackerStats], e.g., related to
   gender questions, however, privacy gender-related information.  Privacy issues andd implication
   and the implications of making such data publicly available were
   discussed as well.

   The datatracker itself is a community tool that welcomes
   contributions, e.g.
   contributions; for example, for additions to the existing interfaces
   or the statistics page directly (see https://notes.ietf.org/iab-aid-
   datatracker-database-overview (https://notes.ietf.org/iab-aid-
   datatracker-database-overview)).  Instructions directly, see the Datatracker Database
   Overview [Data-Overview].  At the time of the workshop, instructions
   about how to set up a local development environment can could be found, at the time of the workshop, found at
   https://notes.ietf.org/iab-aid-data-resources
   (https://notes.ietf.org/iab-aid-data-resources).
   IAB AID Workshop Data Resources [DataResources].  Questions or any
   discussion about the datatracker and possible enhancements can be issued
   sent to tools-discuss@ietf.org.

2.2.  Observations on affiliation Affiliation and industry control Industry Control

   A large portion of the submitted position papers indicated interest
   in researching questions about industry control in the
   standardization process (vs. (as opposed to individual contributions in a
   personal capacity), where industry control covers both, both a) technical contribution
   contributions and the ability to successfully standardize these contribution as
   well as
   contributions and b) competition on leadership roles.  To assess
   these question it
   has ben discussed to investigate participant's affiliations questions, investigating participant affiliations, including
   "indirect" affiliation e.g. affiliations (e.g., by tracking funding and changes in affiliation as
   well as the nessecarity
   affiliation) was discussed.  The need to model company
   characteristics or stakeholder groups.

   Discussions groups was also discussed.

   Discussion about the analysis of IETF data shows that affiliation
   dynamics are hard to capture, capture due to the specifics of how the data is
   entered but also and because of larger social dynamics.  On the side of IETF
   data capture, affiliation is an open text field, which field that causes people to
   write their affiliation down in different ways
   (capitilization, (e.g., capitalization,
   space, word seperation, separation, etc).  A common data format could contribute
   to analyses that compare SDO performance and behavior of actors
   inside and across standards bodies.  To help this with this, a draft data
   model has been was developed during the hackathon portion of the
   workshop which workshop; the
   data model can be found as Annex 1 - Data Taxonomy. in Appendix A.

   Furthermore, there is the issue of mergers and acquisitions mergers, acquisitions, and
   subsidiary companies.  There is no authorotative authoritative exogenous source of
   variation for affiliation changes, so hand-collected and curated data
   is used to analyze changes in affiliation over time.  While this
   approach is imperfect, conclusions can be drawn from the data.  For
   example, in the case of mergers or acquisition where a small
   organizations
   organization joins a large organization, this results in a
   statistically significant increase in liklihood likelihood of an individual
   being put in a working group chair position (see BaronKanevskaia
   (https://www.iab.org/wp-content/IAB-uploads/2021/11/Baron.pdf)). the document by
   Baron and Kanevskaia [LEADERSHIP-POSITIONS]).

2.3.  Community and diversity

   High interest from the Diversity

   The workshop participants was also on were highly interested in using existing
   data to better understand who the current IETF community is,
   especially is.  They
   were also interested in terms of diversity, the community's diversity and how to
   potentially increase
   diversity it and thereby increase inclusivity, e.g. e.g.,
   understanding if are there are certain groups or lists factors that "drive people away"
   and why.  Inclusivity and transparency about the standardization
   process are generally important to keep the Internet and its
   development process viable.  As commented during the workshop
   discussion, when measuring and evaluating different angles of diversity
   diversity, it is also important to understand the actual goals that
   are intended when increasing diversity, e.g. e.g., in order to increase
   competence (mainly technical diversity from different companies and
   stakeholder groups) or relevance (also regional diversity and
   international footprint).

   The discussion on community and diversity spanned from methods that
   draw from novel text mining, time series clustering, graph mining mining,
   and psycholinguistic approaches to understand the consensus mechanism
   to more speculative approaches about what it would take to build a
   feminist Internet.  The discussion also covered the data needed to
   measure who is in the community and how diverse it is.

   The discussion highlighted that part of the challenge is defining
   what diversity means, means and how to measure it, or as one participant
   highlighted to define
   highlighted, defining "who the average IETF IETFer is".  The question  There was
   also raised a
   question about what to do about missing data or non-participating or
   underrepresented communities, like women, individuals from the
   African continent continent, and network operators.  In terms of how IETF data
   is structured, various researchers mentioned that it is hard to track
   conversations as because mail threads, threads split, merge merge, and change.  The ICANN-
   at-large
   ICANN-at-large model came up as an example of how to involve relevant
   stakeholders in the IETF that are currently not present.  Vice versa,  Conversely,
   it is also interesting for outside communities (especially policy
   makers) to get a sense of who the IETF community is and keep them
   updated.

   The human element of the community and diversity was stressed, in highlighted.  In
   order to understand the IETF community's diversity diversity, it is important
   to talk to people (beyond text analysis) and in analysis).  In order to ensure
   inclusivity
   inclusivity, individual participants must make an effort to, as one
   participant recounted, tell them their participation is valuable.

2.4.  Publications, process, Process, and decision-making Decision Making

   A number of submissions focussed focused on the RFC publication process, on
   the development of standards and other RFCs in the IETF, and on how
   the IETF makes decisions.  This included work on both technical decisions
   about the content of the standards, but also on procedural and process
   decisions, and on questions around how we can understand, model, and
   perhaps improve the standards process.  Some of the work considered
   what makes a successful RFC, an RFC successful, how are RFCs are used and referenced, and
   what we can learn about the importance of a topic by studying the
   RFCs, drafts, Internet-Drafts, and email discussion. discussions.

   There were three sets of questions to consider in this area.  The
   first question related to the success and failure of standards, standards and considered
   what
   considered:

   *  What makes a successful/good successful and good RFC?

   *  What makes the process of RFC making RFCs successful?  And how

   *  How are RFCs used and referenced once published?

   Discussion considered how to better understand the path from an internet draft
   Internet-Draft to an RFC, to see if there are specific factors that
   lead to successful development of a draft an Internet-Draft into an RFC.
   Participants explored the extent to which this depends on the
   seniority and experience of the authors, on the topic and IETF area,
   on the extent and scope of mailing list discussion, and other
   factors, to understand whether success of a draft an Internet-Draft can be predicted,
   predicted and whether interventions can be developed to increase the
   likelihood of success for work.

   The second question was around focused on decision making.

   *  How does the IETF make design decisions?

   *  What are the bottlenecks in effective decision making?

   *  When is a decision made?  And what is the decision?

   Difficulties here lie in capturing decisions and the results of
   consensus calls early in the process, and understanding the factors
   that lead to effective decision making.

   Finally, there were questions around regarding what can be learn learned about
   protocols by studying IETF publications, processes, and decision
   making?
   making.  For example, are example:

   *  Are there insights to be gained around how security concerns are
      discussed and considered in the development of standards?

   *  Is it possible to verify correctness of protocols and/or and detect
      ambiguities?

   *  What can be learnt learned by extracting insights from implementations
      and activities on implementation efforts?

   Answers to these questions will come from analysis of IETF emails,
   RFCs and Internet-Drafts, meeting minutes, recordings, Github data,
   and external data such as surveys, etc.

2.5.  Environmental Sustainability

   The final discussion session considered environmental sustainability.
   It discussed
   Topics included what is the IETF's role with respect to climate change change,
   both in terms on of what is the environmental impact of the way the IETF
   develops standards, standards and in terms of what is the environmental impact
   of the standards the IETF develops? develops.

   Discussion started by considering how sustainable are IETF meetings,
   focussing meetings are,
   focusing on how much CO2 the amount of carbon dioxide (CO2) emissions are IETF
   meetings are responsible for and how can we make the IETF more
   sustainable.  Analysis looked at the home locations of participants,
   meeting locations, and carbon footprint of air travel and remote attendance,
   attendance to estimate the carbon CO2 costs of an IETF meeting.  Initial  While the
   analysis is ongoing, initial results suggest that the costs of
   holding multiple in-person IETF meetings per year are likely
   unsustainable in terms of carbon emission, although the analysis is
   ongoing.

   Discussion also considered to what CO2 emission.

   The extent are to which climate impacts are considered in during the
   development and standardization of Internet
   protocols?  It reviewed the text of protocols was discussed.
   RFCs and Internet-Drafts of active working group
   drafts, looking groups were reviewed for
   relevant keywords to highlight the extent to which climate change,
   energy efficiency, and related topics are were considered in the design
   of Internet protocols, to show protocols.  This review revealed the limited extent to
   which these topics have been considered.  Ongoing work  There is
   considering meeting minutes and mail archives, ongoing work to
   get a fuller
   picture, picture by reviewing meeting minutes and mail archives
   as well, but initial results show only limited consideration of these
   important issues.

3.  Hackathon Report

   The middle two days of the workshop were organized as a hackathon.
   The aims of the hackathon were to 1) acquaint people with the
   different data sources and analysis methods, 2) seek to answer some
   of the questions that came up during presentations on the first day
   of the workshop, and 3) foster collaboration among researchers to
   grow a community of IETF data researchers.

   At the end of Day 1, the plenary presentation day, people were
   invited to divide themselves in into groups who selected and select their own
   respective facilitators.  All groups had their own work space and
   could use their own communication methods and channels, or use IETF's
   gather.town. channels.  Furthermore,
   daily check-ins were organized during the two hackathon days.  At  On the
   final day day, the hackathon groups presented their work in a plenary
   session.

   The objectives of the hackathon, according

   According to the co-chairs, the objectives of the hackathon have been
   met, and the output significantly exceeded expectations.  It allowed for
   more interaction then than academic conferences and produced some actual
   research results by people who had not collaborated before the
   workshop.

   Future workshops that choose to integrate a hackathon could consider
   to ask
   asking participants to submit groups, issues, issues and questions beforehand
   (potentially as part of the positions paper position papers or the sign-up process)
   to facilitate the formation of groups.

4.  Position Papers

4.1.  Tools, data, Data, and methods Methods

   Sebastian Benthall Using Benthall, "Using Complex Systems Analysis to Identify
   Organizational Interventions (https://www.iab.org/wp-content/IAB-
   uploads/2021/11/Benthall.pdf) Interventions" [COMPLEX-SYSTEMS]

   Stephen McQuistin, McQuistin and Colin Perkins The Perkins, "The ietfdata Library
   (https://www.iab.org/wp-content/IAB-uploads/2021/11/McQuistin.pdf) Library"
   [ietfdata-Library]

   Marc Petit-Huguenin The Petit-Huguenin, "The RFC Prolog Database (https://www.iab.org/wp-
   content/IAB-uploads/2021/11/Petit-Huguenin.txt) Database" [PROLOG-DATABASE]

   Jari Arkko Observations Arkko, "Observations about IETF process measurements
   (https://www.iab.org/wp-content/IAB-uploads/2021/11/Arkko.pdf) measurements"
   [MEASURING-IETF-PROCESSES]

4.2.  Observations on affiliation Affiliation and industry control Industry Control

   Justus Baron, Baron and Olia Kanevskaia Competition Kanevskaia, "Competition for Leadership
   Positions in Standards Development Organizations (https://www.iab.org/wp-content/
   IAB-uploads/2021/11/Baron.pdf) Organizations"
   [LEADERSHIP-POSITIONS]

   Nick Doty Analyzing Doty, "Analyzing IETF Data: Changing affiliations
   (https://www.iab.org/wp-content/IAB-uploads/2021/11/Doty.pdf) affiliations"
   [ANALYZING-AFFILIATIONS]

   Don Le Le, "Analysing IETF Data Position Paper (https://www.iab.org/wp-content/IAB-
   uploads/2021/11/Le.pdf) Paper" [ANALYSING-IETF]

   Elizaveta Yachmeneva Research Proposal (https://www.iab.org/wp-
   content/IAB-uploads/2021/11/Yachmeneva.pdf) Yachmeneva, "Research Proposal" [RESEARCH-PROPOSAL]

4.3.  Community and diversity Diversity

   Priyanka Sinha, Michael Ackermann, Pabitra Mitra, Arvind Singh, and
   Amit Kumar Agrawal Characterizing Agrawal, "Characterizing the IETF through its consensus
   mechanisms (https://www.iab.org/wp-content/IAB-uploads/2021/11/
   Sinha.pdf)
   mechanisms" [CONSENSUS-MECHANISMS]

   Mallory Knodel Would Knodel, "Would feminists have built a better internet?
   (https://www.iab.org/wp-content/IAB-uploads/2021/11/Knodel.pdf) internet?"
   [FEMINIST-INTERNET]

   Wes Hardaker, Hardaker and Genevieve Bartlett Identifying Bartlett, "Identifying temporal trends in
   IETF
   participation (https://www.iab.org/wp-content/IAB-uploads/2021/11/
   Hardaker.pdf) participation" [TEMPORAL-TRENDS]

   Lars Eggert Who Eggert, "Who is the Average IETF Participant?
   (https://www.iab.org/wp-content/IAB-uploads/2021/11/Eggert.pdf) Participant?"
   [AVERAGE-PARTICIPANT]

   Emanuele Tarantino, Justus Baron, Bernhard Ganglmair, Nicola Persico,
   and Timothy Simcoe Representation Simcoe, "Representation is Not Sufficient for Selecting
   Gender
   Diversity (https://www.iab.org/wp-content/IAB-uploads/2021/11/
   Tarantino.pdf) Diversity" [GENDER-DIVERSITY]

4.4.  Publications, process, Process, and decision-making Decision Making

   Michael Welzl, Carsten Griwodz, and Safiqul Islam Understanding Islam, "Understanding
   Internet Protocol Design Decisions (https://www.iab.org/wp-content/IAB-
   uploads/2021/11/Welzl.pdf) Decisions" [DESIGN-DECISIONS]

   Ignacio Castro et al Characterising al., "Characterising the IETF through the lens of
   RFC
   deployment (https://dl.acm.org/doi/abs/10.1145/3487552.3487821) deployment" [RFC-DEPLOYMENT]

   Carsten Griwodz, Safiqul Islam, and Michael Welzl The Welzl, "The Impact of
   Continuity (https://www.iab.org/wp-content/IAB-uploads/2021/11/
   Griwodz.pdf)
   Continuity" [CONTINUITY]

   Paul Hoffman RFCs Change (https://www.iab.org/wp-content/IAB-
   uploads/2021/11/Hoffman.pdf) Hoffman, "RFCs Change" [RFCs-CHANGE]

   Xue Li, Sara Magliacane, and Paul Groth The Groth, "The Challenges of
   Cross-Document Coreference Resolution in Email (https://www.iab.org/wp-content/IAB-
   uploads/2021/11/Groth.pdf) Email"
   [CROSS-DOC-COREFERENCE]

   Amelia Andersdotter Project Andersdotter, "Project in time series analysis: e-mailing lists
   (https://www.iab.org/wp-content/IAB-uploads/2021/11/Andersdotter.pdf)
   lists" [E-MAILING-LISTS]

   Mark McFadden McFadden, "A Position Paper (https://www.iab.org/wp-content/IAB-
   uploads/2021/11/McFadden.pdf) by Mark McFadden" [POSITION-PAPER]

4.5.  Environmental Sustainability

   Christoph Becker Towards Becker, "Towards Environmental Sustainability with the IETF
   (https://www.iab.org/wp-content/IAB-uploads/2021/11/Becker.pdf)
   IETF" [ENVIRONMENTAL]

   Daniel Migault CO2eq: Migault, "CO2eq: Estimating Meetings' Air Flight CO2
   Equivalent Emissions: An Illustrative Example with IETF meetings
   (https://www.iab.org/wp-content/IAB-uploads/2021/11/Migault.pdf) meetings"
   [CO2eq]

5.  Workshop participants

   Bernhard Ganglmair, Carsten Griwodz, Christoph Becker, Colin Perkins,
   Corinne Cath, Daniel Migault, Don Le, Effy Xue Li, Elizaveta
   Yachmeneva, Francois Ortolan, Greg Wood, Ignacio Castro, Jari Arkko,
   Justus Baron, Karen O'Donoghue, Lars Eggert, Mallory Knodel, Marc
   Petit-Huguenin, Mark McFadden, Michael Welzl, Mirja Kuehlewind, Nick  Informative References

   [ANALYSING-IETF]
              Article 19, "Analysing IETF Position Paper",
              <https://www.iab.org/wp-content/IAB-uploads/2021/11/
              Le.pdf>.

   [ANALYZING-AFFILIATIONS]
              Doty, Niels ten Oever, Priyanka N., "Analyzing IETF Data: Changing affiliations",
              September 2021, <https://www.iab.org/wp-content/IAB-
              uploads/2021/11/Doty.pdf>.

   [ArkkoStats]
              "Document Statistics",
              <https://www.arkko.com/tools/docstats.html>.

   [AVERAGE-PARTICIPANT]
              Eggert, L., "Who is the Average IETF Participant?",
              November 2021, <https://www.iab.org/wp-content/IAB-
              uploads/2021/11/Eggert.pdf>.

   [BigBang]  BigBang, "Welcome to BigBang’s documentation!",
              <https://bigbang-py.readthedocs.io/en/latest/>.

   [CO2eq]    Migault, D., "CO2eq: Estimating Meetings' Air Flight CO2
              Equivalent Emissions: An Illustrative Example with IETF
              meeting", <https://www.iab.org/wp-content/IAB-
              uploads/2021/11/Migault.pdf>.

   [COMPLEX-SYSTEMS]
              Benthall, S., "Using Complex Systems Analysis to Identify
              Organizational Interventions", 2021, <https://www.iab.org/
              wp-content/IAB-uploads/2021/11/Benthall.pdf>.

   [CONSENSUS-MECHANISMS]
              Sinha, Safiqul P., Ackermann, M., Mitra, P., Singh, A., and A.
              Kumar Agrawal, "Characterizing the IETF through its
              consensus mechanisms", <https://www.iab.org/wp-content/
              IAB-uploads/2021/11/Sinha.pdf>.

   [CONTINUITY]
              Griwodz, C., Islam, Sebastian
   Benthall, Stephen McQuistin, Wes Hardaker, S., and Zhenbin Li.

6.  Program Committee

   The workshop Program Committee members were Niels ten Oever (chair,
   University M. Welzl, "The Impact of Amsterdam), Colin Perkins (chair, IRTF, University
              Continuity", <https://www.iab.org/wp-content/IAB-
              uploads/2021/11/Griwodz.pdf>.

   [CROSS-DOC-COREFERENCE]
              Li, X., Magliacane, S., and P. Groth, "The Challenges of
   Glasgow), Corinne Cath (chair, Oxford
              Cross-Document Coreference Resolution in Email",
              <https://www.iab.org/wp-content/IAB-uploads/2021/11/
              Groth.pdf>.

   [Data-Overview]
              "Datatracker Database Overview", for the IAB AID Workshop,
              <https://notes.ietf.org/iab-aid-datatracker-database-
              overview#>.

   [DataResources]
              "IAB AID Workshop Data Resources",
              <https://notes.ietf.org/iab-aid-data-resources#>.

   [Datatracker]
              IETF, "Datatracker", <https://datatracker.ietf.org/>.

   [DatatrackerStats]
              IETF, "Statistics", <https://datatracker.ietf.org/stats/>.

   [DESIGN-DECISIONS]
              Welzl, M., Griwodz, C., and S. Islam, "Understanding
              Internet Institute), Mirja
   Kuehlewind (IAB, Ericsson), Zhenbin Li (IAB, Huawei), Protocol Design Decisions", <https://www.iab.org/
              wp-content/IAB-uploads/2021/11/Welzl.pdf>.

   [E-MAILING-LISTS]
              Andersdotter, A., "Project in time series analysis:
              e-mailing lists", May 2018, <https://www.iab.org/wp-
              content/IAB-uploads/2021/11/Andersdotter.pdf>.

   [ENVIRONMENTAL]
              Becker, C., "Towards Environmental Sustainability with the
              IETF", <https://www.iab.org/wp-content/IAB-
              uploads/2021/11/Becker.pdf>.

   [FEMINIST-INTERNET]
              Knodel, M., "Would feminists have built a better
              internet?", September 2021, <https://www.iab.org/wp-
              content/IAB-uploads/2021/11/Knodel.pdf>.

   [GENDER-DIVERSITY]
              Baron, J., Ganglmair, B., Persico, N., Simcoe, T., and Wes
   Hardaker (IAB, USC/ISI).

7.  Acknowledgments

   The Program Committee wishes to extend its thanks to Cindy Morgan E.
              Tarantino, "Representation is Not Sufficient for
   logistics support Selecting
              Gender Diversity", August 2021, <https://www.iab.org/wp-
              content/IAB-uploads/2021/11/Tarantino.pdf>.

   [IETF-RFCs]
              IETF, "RFCs", <https://www.ietf.org/standards/rfcs/>.

   [IETF-Statistics]
              IETF, "Web analytics",
              <https://www.ietf.org/policies/web-analytics/>.

   [ietfdata] "IETF Data", Internet Protocols Laboratory, commit
              c53bf15, August 2022,
              <https://github.com/glasgow-ipl/ietfdata>.

   [ietfdata-Library]
              McQuistin, S. and to Kate Pundyk C. Perkins, "The ietfdata Library",
              <https://www.iab.org/wp-content/IAB-uploads/2021/11/
              McQuistin.pdf>.

   [LEADERSHIP-POSITIONS]
              Baron, J. and O. Kanevskaia, "Competition for notetaking.

   Efforts put Leadership
              Positions in this workshop by Niels ten Oever was made possible
   through funding from Standards Development Organizations", October
              2021, <https://www.iab.org/wp-content/IAB-uploads/2021/11/
              Baron.pdf>.

   [Mail-Arch]
              IETF, "Mail Archive",
              <https://mailarchive.ietf.org/arch/>.

   [MEASURING-IETF-PROCESSES]
              Arkko, J., "Observations about IETF process measurements",
              <https://www.iab.org/wp-content/IAB-uploads/2021/11/
              Arkko.pdf>.

   [POSITION-PAPER]
              McFadden, M., "A Position Paper", <https://www.iab.org/wp-
              content/IAB-uploads/2021/11/McFadden.pdf>.

   [PROLOG-DATABASE]
              Huguenin, P., "The RFC Prolog Database", September 2021,
              <https://www.iab.org/wp-content/IAB-uploads/2021/11/Petit-
              Huguenin.txt>.

   [RESEARCH-PROPOSAL]
              Yachmeneva, E., "Research Proposal", <https://www.iab.org/
              wp-content/IAB-uploads/2021/11/Yachmeneva.pdf>.

   [RFC-DEPLOYMENT]
              Castro, I., Healey, P., Iqbal, W., Karan, M., Khare, P.,
              McQuistin, S., Perkins, C., Purver, M., Qadir, J., and G.
              Tyson, "Characterising the Dutch Research Council (NWO) IETF through grant
   MVI.19.032 as part of the programme 'Maatschappelijk Verantwoord
   Innoveren (MVI)'.

   We would like to thank the Ford Foundation for their support that
   made participation lens of Corinne Cath, Kate Pundyk, RFC
              deployment", November 2021,
              <https://dl.acm.org/doi/abs/10.1145/3487552.3487821>.

   [RFCs-CHANGE]
              Hoffman, P., "RFCs Change", September 2021,
              <https://www.iab.org/wp-content/IAB-uploads/2021/11/
              Hoffman.pdf>.

   [Survey-Data]
              IETF, "IETF Community Survey 2021", 11 August 2021,
              <https://www.ietf.org/blog/ietf-community-survey-2021/>.

   [TEMPORAL-TRENDS]
              Hardaker, W. and Mallory Knodel
   possible (grant number, 136179, 2020).

   Efforts in the organization of this workshop by Niels ten Oever were
   supported G. Bartlett, "Identifying temporal trends
              in part by the UK Engineering and Physical Sciences
   Research Council under grant EP/S036075/1.

8.  Annexes

8.1.  Annex 1 - IETF participation", September 2021,
              <https://www.iab.org/wp-content/IAB-uploads/2021/11/
              Hardaker.pdf>.

Appendix A.  Data Taxonomy

A Draft Data Taxonomy for SDO Data:

Organization:
  Organization Subsidiary
  Time
  Email domain
  Website domain
  Size
          Revenue, annual
          Number of employees
  Org - Affiliation Category (Labels) ; 1 : N
    Association
    Advertising Company
    Chipmaker
    Content Distribution Network
    Content Providers
    Consulting
    Cloud Provider
    Cybersecurity
    Financial Institution
    Hardware vendor
    Internet Registry
    Infrastructure Company
    Networking Equipment Vendor
    Network Service Provider
    Regional Standards Body
    Regulatory Body
    Research and Development Institution
    Software Provider
    Testing and Certification
    Telecommunications Provider
    Satellite Operator

Org - Stakeholder Group : 1 - 1
    Academia
    Civil Society
    Private Sector -- including industry consortia and associations;
    state-owned and government-funded businesses
    Government
    Technical Community (IETF, ICANN, ETSI, 3GPP, oneM2M, etc)
    Intergovernmental organization

SDO:
  Membership Types (SDO)
  Members (Organizations for some, individuals for others…) others...)
  Membership organization
    Regional SDO
      ARIB
      ATIS
      CCSA
      ETSI
      TSDSI
      TTA
      TTC
    Consortia

Country of Origin:
  Country Code

Number of Participants

Patents
  Organization
  Authors - 1 : N - Persons/Participants
  Time
  Region
  Patent Pool
  Standard Essential Patent
    If so, for which standard

Participant (An individual person)
  Name
  1: N - Emails
    Time start / time end

  1 : N : Affiliation
    Organization
    Position
          Time start / end

  1 : N : Affiliation - SDO
    Position
    SDO
    Time

  Email Domain (personal domain)

  (Contribution data is in other tables)

Document
  Status of Document
          Internet Draft
          Work Item
    Standard
  Author -
    Name
          Affiliation - Organization
    Person/Participant
        (Affiliation from Authors only?)

Data Source - Provenance for any data imported from an external data set

Meeting
  Time
  Place
  Agenda
  Registrations
    Name
    Email
    Affiliation

Appendix B.  Program Committee

   The workshop Program Committee members were Niels ten Oever (Chair,
   University of Amsterdam), Colin Perkins (Chair, IRTF, University of
   Glasgow), Corinne Cath (Chair, Oxford Internet Institute), Mirja
   Kuehlewind (IAB, Ericsson), Zhenbin Li (IAB, Huawei), and Wes
   Hardaker (IAB, USC/ISI).

Appendix C.  Workshop Participants

   The Workshop Participants were Bernhard Ganglmair, Carsten Griwodz,
   Christoph Becker, Colin Perkins, Corinne Cath, Daniel Migault, Don
   Le, Effy Xue Li, Elizaveta Yachmeneva, Francois Ortolan, Greg Wood,
   Ignacio Castro, Jari Arkko, Justus Baron, Karen O'Donoghue, Lars
   Eggert, Mallory Knodel, Marc Petit-Huguenin, Mark McFadden, Michael
   Welzl, Mirja Kuehlewind, Nick Doty, Niels ten Oever, Priyanka Sinha,
   Safiqul Islam, Sebastian Benthall, Stephen McQuistin, Wes Hardaker,
   and Zhenbin Li.

IAB Members at the Time of Approval

   Internet Architecture Board members at the time this document was
   approved for publication were:

      Jari Arkko
      Deborah Brungard
      Lars Eggert
      Wes Hardaker
      Cullen Jennings
      Mallory Knodel
      Mirja Kühlewind
      Zhenbin Li
      Tommy Pauly
      David Schinazi
      Russ White
      Quin Wu
      Jiankang Yao

Acknowledgments

   The Program Committee wishes to extend its thanks to Cindy Morgan for
   logistics support and to Kate Pundyk for note-taking.

   We would like to thank the Ford Foundation for their support that
   made participation of Corinne Cath, Kate Pundyk, and Mallory Knodel
   possible (grant number, 136179, 2020).

   Efforts put in this workshop by Niels ten Oever were made possible
   through funding from the Dutch Research Council (NWO) through grant
   MVI.19.032 as part of the program 'Maatschappelijk Verantwoord
   Innoveren (MVI)'.

   Efforts in the organization of this workshop by Colin Perkins were
   supported in part by the UK Engineering and Physical Sciences
   Research Council under grant EP/S036075/1.

Authors' Addresses

   Niels ten Oever
   University of Amsterdam
   Email: mail@nielstenoever.net

   Corinne Cath
   University of Cambridge
   Email: corinnecath@gmail.com

   Mirja Kühlewind
   Ericsson
   Email: mirja.kuehlewind@ericsson.com

   Colin Perkins
   University of Glasgow
   Email: csp@csperkins.org