subscribe to arXiv mailings

The Rise and Fall of the Initial Era

Abstract: Bibliographic data is a rich source of information that goes beyond the use cases of location and citation -- it also encodes both cultural and technological context. For most of its existence, the scholarly record has changed slowly and hence provides an opportunity to gain insight through its reflection of the cultural norms of the research community over the last four centuries. While it is oft… ▽ More Bibliographic data is a rich source of information that goes beyond the use cases of location and citation -- it also encodes both cultural and technological context. For most of its existence, the scholarly record has changed slowly and hence provides an opportunity to gain insight through its reflection of the cultural norms of the research community over the last four centuries. While it is often difficult to distinguish the originating driver of change, it is still valuable to consider the motivating influences that have led to changes in the structure of the scholarly record. An "initial era" is identified during which initials were used in preference to full names by authors on scholarly communications. Causes of the emergence and demise of this era are considered as well as the implications of this era on research culture and practice. △ Less

Submitted 8 May, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

Comments: 20 pages, 18 figures, updated references, some expanded commentary on gender analysis

arXiv:2401.04022 [pdf, other]

Identifying Fabricated Networks within Authorship-for-Sale Enterprises

Authors: Simon J. Porter, Leslie D. McIntosh

Abstract: Fabricated papers do not just need text, images, and data, they also require a fabricated or partially fabricated network of authors. Most `authors' on a fabricated paper have not been associated with the research, but rather are added through a transaction. This lack of deeper connection means that there is a low likelihood that co-authors on fabricated papers will ever appear together on the sam… ▽ More Fabricated papers do not just need text, images, and data, they also require a fabricated or partially fabricated network of authors. Most `authors' on a fabricated paper have not been associated with the research, but rather are added through a transaction. This lack of deeper connection means that there is a low likelihood that co-authors on fabricated papers will ever appear together on the same paper more than once. This paper constructs a model that encodes some of the key characteristics of this activity in an `authorship-for-sale' network with the aim to create a robust method to detect this type of activity. A characteristic network fingerprint arises from this model that provides a robust statistical approach to the detection of paper-mill networks. The model suggested in this paper detects networks that have a statistically significant overlap with other approaches that principally rely on textual analysis for the detection of fraudulent papers. Researchers connected to networks identified using the methodology outlined in this paper are shown to be connected with 37% of papers identified through the tortured-phrase and clay-feet methods deployed in the Problematic Paper Screener website. Finally, methods to limit the expansion and propagation of these networks is discussed both in technological and social terms. △ Less

Submitted 8 January, 2024; originally announced January 2024.

arXiv:2211.06579 [pdf]

Explainable Artificial Intelligence: Precepts, Methods, and Opportunities for Research in Construction

Authors: Peter ED Love, Weili Fang, Jane Matthews, Stuart Porter, Hanbin Luo, Lieyun Ding

Abstract: Explainable artificial intelligence has received limited attention in construction despite its growing importance in various other industrial sectors. In this paper, we provide a narrative review of XAI to raise awareness about its potential in construction. Our review develops a taxonomy of the XAI literature comprising its precepts and approaches. Opportunities for future XAI research focusing o… ▽ More Explainable artificial intelligence has received limited attention in construction despite its growing importance in various other industrial sectors. In this paper, we provide a narrative review of XAI to raise awareness about its potential in construction. Our review develops a taxonomy of the XAI literature comprising its precepts and approaches. Opportunities for future XAI research focusing on stakeholder desiderata and data and information fusion are identified and discussed. We hope the opportunities we suggest stimulate new lines of inquiry to help alleviate the scepticism and hesitancy toward AI adoption and integration in construction. △ Less

Submitted 10 February, 2023; v1 submitted 12 November, 2022; originally announced November 2022.

Comments: 56 pages, 3 figures. arXiv admin note: text overlap with arXiv:1910.10045 by other authors

ACM Class: H.0; H.4; J.0

arXiv:2211.06561 [pdf]

Explainable Artificial Intelligence in Construction: The Content, Context, Process, Outcome Evaluation Framework

Authors: Peter ED Love, Jane Matthews, Weili Fang, Stuart Porter, Hanbin Luo, Lieyun Ding

Abstract: Explainable artificial intelligence is an emerging and evolving concept. Its impact on construction, though yet to be realised, will be profound in the foreseeable future. Still, XAI has received limited attention in construction. As a result, no evaluation frameworks have been propagated to enable construction organisations to understand the what, why, how, and when of XAI. Our paper aims to fill… ▽ More Explainable artificial intelligence is an emerging and evolving concept. Its impact on construction, though yet to be realised, will be profound in the foreseeable future. Still, XAI has received limited attention in construction. As a result, no evaluation frameworks have been propagated to enable construction organisations to understand the what, why, how, and when of XAI. Our paper aims to fill this void by developing a content, context, process, and outcome evaluation framework that can be used to justify the adoption and effective management of XAI. After introducing and describing this novel framework, we discuss its implications for future research. While our novel framework is conceptual, it provides a frame of reference for construction organisations to make headway toward realising XAI business value and benefits. △ Less

Submitted 11 November, 2022; originally announced November 2022.

Comments: 43 pages, 5 figures

ACM Class: H.0; H.4; J.0

arXiv:2209.00104 [pdf, other]

doi 10.1162/qss_a_00244

Recategorising research: Mapping from FoR 2008 to FoR 2020 in Dimensions

Authors: Simon J Porter, Lezan Hawizy, Daniel W Hook

Abstract: In 2020 the Australia New Zealand Standard Research Classification Fields of Research Codes (ANZSRC FoR codes) were updated by their owners. This has led the sector to need to update their systems of reference and has caused suppliers working in the research information sphere to need to update both systems and data. This paper describes the approach developed by Digital Science's Dimensions team… ▽ More In 2020 the Australia New Zealand Standard Research Classification Fields of Research Codes (ANZSRC FoR codes) were updated by their owners. This has led the sector to need to update their systems of reference and has caused suppliers working in the research information sphere to need to update both systems and data. This paper describes the approach developed by Digital Science's Dimensions team to the creation of an improved machine learning training set, and the mapping of that set from FoR 2008 codes to FoR 2020 codes so that Dimensions classification approach for the ANZSRC codes could be improved and updated. △ Less

Submitted 21 January, 2023; v1 submitted 31 August, 2022; originally announced September 2022.

Comments: 10 pages, 6 figures, v2 - more information on translation of dataset to production system, author added to reflect these changes

arXiv:2112.08472 [pdf, other]

doi 10.3389/frma.2022.835139

Connecting Scientometrics: Dimensions as a route to broadening context for analyses

Authors: Simon J Porter, Daniel W Hook

Abstract: Modern cloud-based data infrastructures open new vistas for the deployment of scientometric data into the hands of practitioners. These infrastructures lower barriers to entry by making data more available and compute capacity more affordable. In addition, if data are prepared appropriately, with unique identifiers, it is possible to connect many different types of data. Bringing broader world dat… ▽ More Modern cloud-based data infrastructures open new vistas for the deployment of scientometric data into the hands of practitioners. These infrastructures lower barriers to entry by making data more available and compute capacity more affordable. In addition, if data are prepared appropriately, with unique identifiers, it is possible to connect many different types of data. Bringing broader world data into the hands of practitioners (policymakers, strategists and others) who use scientometrics as a tool can extend their capabilities. These ideas are explored through connecting Dimensions and World Bank data on Google BigQuery to study international collaboration between countries of different economic classification. △ Less

Submitted 15 December, 2021; originally announced December 2021.

Comments: 8 pages, 8 figures

arXiv:2109.13640 [pdf, other]

Measuring Research Information Citizenship Across ORCID Practice

Authors: Simon Porter

Abstract: Over the past 10 years stakeholders across the scholarly communications community have invested significantly not only to increase the adoption of ORCID adoption by researchers, but also to build the the broader infrastructures that are needed both to support ORCID and to benefit from it. These parallel efforts have fostered the emergence of "research information citizenry", which comprises, but i… ▽ More Over the past 10 years stakeholders across the scholarly communications community have invested significantly not only to increase the adoption of ORCID adoption by researchers, but also to build the the broader infrastructures that are needed both to support ORCID and to benefit from it. These parallel efforts have fostered the emergence of "research information citizenry", which comprises, but is not limited to, researchers, publishers, funders, and institutions. This paper takes a scientometric approach to investigating how effectively ORCID roles and responsibilities within this citizenry have been adopted. Focusing specifically on researchers, publishers, and funders, ORCID behaviours are measured against the approximated research world represented by the Dimensions dataset. △ Less

Submitted 29 September, 2021; v1 submitted 28 September, 2021; originally announced September 2021.

arXiv:2101.09567 [pdf, other]

doi 10.3389/frma.2021.656233

Scaling Scientometrics: Dimensions on Google BigQuery as an infrastructure for large-scale analysis

Authors: Daniel W Hook, Simon J Porter

Abstract: Cloud computing has the capacity to transform many parts of the research ecosystem, from particular research areas to overall strategic decision making and policy. Scientometrics sits at the boundary between research and the decision making and evaluation processes of research. One of the biggest challenges in research policy and strategy is having access to data that allows iterative analysis to… ▽ More Cloud computing has the capacity to transform many parts of the research ecosystem, from particular research areas to overall strategic decision making and policy. Scientometrics sits at the boundary between research and the decision making and evaluation processes of research. One of the biggest challenges in research policy and strategy is having access to data that allows iterative analysis to inform decisions. Many of these decisions are based on "global" measures such as benchmark metrics that are hard to source. In this article, Cloud technologies are explored in this context. A novel visualisation technique is presented and used as a means to explore the potential for scaling scientometrics by democratising both access to data and compute capacity using the Cloud. △ Less

Submitted 23 January, 2021; originally announced January 2021.

Comments: 12 pages, 5 figures

arXiv:1804.07511 [pdf, other]

IP Over ICN Goes Live

Authors: George Xylomenos, Yannis Thomas, Xenofon Vasilakos, Michael Georgiades, Alexander Phinikarides, Ioannis Doumanis, Stuart Porter, Dirk Trossen, Sebastian Robitzsch, Martin J. Reed, Mays Al-Naday, George Petropoulos, Konstantinos Katsaros, Maria-Evgenia Xezonaki, Janne Riihijarvi

Abstract: Information-centric networking (ICN) has long been advocating for radical changes to the IP-based Internet. However, the upgrade challenges that this entails have hindered ICN adoption. To break this loop, the POINT project proposed a hybrid, IP-over-ICN, architecture: IP networks are preserved at the edge, connected to each other over an ICN core. This exploits the key benefits of ICN, enabling i… ▽ More Information-centric networking (ICN) has long been advocating for radical changes to the IP-based Internet. However, the upgrade challenges that this entails have hindered ICN adoption. To break this loop, the POINT project proposed a hybrid, IP-over-ICN, architecture: IP networks are preserved at the edge, connected to each other over an ICN core. This exploits the key benefits of ICN, enabling individual network operators to improve the performance of their IP-based services, without changing the rest of the Internet. We provide an overview of POINT and outline how it improves upon IP in terms of performance and resilience. Our focus is on the successful trial of the POINT prototype in a production network, where real users operated actual IP-based applications. △ Less

Submitted 13 May, 2018; v1 submitted 20 April, 2018; originally announced April 2018.

Comments: EuCNC 2018. arXiv admin note: text overlap with arXiv:1804.07509

arXiv:1804.07509 [pdf, other]

IPTV Over ICN

Authors: George Xylomenos, Alexander Phinikarides, Ioannis Doumanis, Xenofon Vasilakos, Yannis Thomas, Dirk Trossen, Michael Georgiades, Stuart Porter

Abstract: The efficient provision of IPTV services requires support for IP multicasting and IGMP snooping, limiting such services to single operator networks. Information-Centric Networking (ICN), with its native support for multicast seems ideal for such services, but it requires operators and users to overhaul their networks and applications. The POINT project has proposed a hybrid, IP-over-ICN, architect… ▽ More The efficient provision of IPTV services requires support for IP multicasting and IGMP snooping, limiting such services to single operator networks. Information-Centric Networking (ICN), with its native support for multicast seems ideal for such services, but it requires operators and users to overhaul their networks and applications. The POINT project has proposed a hybrid, IP-over-ICN, architecture, preserving IP devices and applications at the edge, but interconnecting them via an SDN-based ICN core. This allows individual operators to exploit the benefits of ICN, without expecting the rest of the Internet to change. In this paper, we first outline the POINT approach and show how it can handle multicast-based IPTV services in a more efficient and resilient manner than IP. We then describe a successful trial of the POINT prototype in a production network, where real users tested actual IPTV services over both IP and POINT under regular and exceptional conditions. Results from the trial show that the POINT prototype matched or improved upon the services offered via plain IP. △ Less

Submitted 13 May, 2018; v1 submitted 20 April, 2018; originally announced April 2018.

Comments: Packet Video Workshop 2018

Showing 1–10 of 10 results for author: Porter, S