The development of techniques for text classification, categorization, clustering and segregation has a long history of appli- cations in a variety of fields, including social network analysis, document archiving, business document processing. The field of private procurement, in which many application domains are included, is a small but very challenging area for the aforementioned concepts. After commerce globalization (in the nineties), e-commerce B2C explosion (in the years two- thousands) and the emergence of B2B international processes for e-procurement (in the years two-thousands-ten) we are now in a post-COVID era in which the internationalisation process has reached momentum. We are in a position of consider- ing a front made up of multi-lingual, development differential and transparent market, for which comparison processes are ubiquitously required. In this survey we found major trends in the future of text classification employed in a multilingual, multicultural and non-standardized procurement processes. The usage of Large Language Models, and in particular the development of a specific field of post-processing of answers from LLM that is the dual component of prompt engineering, an emerging field in LLM, shall settle a new environment for procurement. We envision an application domain made of the dual usage of prompt engineering and post-processing algorithms to improve the performances of classification technologies for e-procurement. Moreover, the development of translation abilities of LLM as well as other approaches of machine translation will bring novel quality levels for these applications.

Text classification for private procurement: a survey and an analysis of future trends

Bellomi, Francesco
Membro del Collaboration Group
;
Cristani, Matteo
Membro del Collaboration Group
2024-01-01

Abstract

The development of techniques for text classification, categorization, clustering and segregation has a long history of appli- cations in a variety of fields, including social network analysis, document archiving, business document processing. The field of private procurement, in which many application domains are included, is a small but very challenging area for the aforementioned concepts. After commerce globalization (in the nineties), e-commerce B2C explosion (in the years two- thousands) and the emergence of B2B international processes for e-procurement (in the years two-thousands-ten) we are now in a post-COVID era in which the internationalisation process has reached momentum. We are in a position of consider- ing a front made up of multi-lingual, development differential and transparent market, for which comparison processes are ubiquitously required. In this survey we found major trends in the future of text classification employed in a multilingual, multicultural and non-standardized procurement processes. The usage of Large Language Models, and in particular the development of a specific field of post-processing of answers from LLM that is the dual component of prompt engineering, an emerging field in LLM, shall settle a new environment for procurement. We envision an application domain made of the dual usage of prompt engineering and post-processing algorithms to improve the performances of classification technologies for e-procurement. Moreover, the development of translation abilities of LLM as well as other approaches of machine translation will bring novel quality levels for these applications.
2024
Text retrieval, Procurement, LLM
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11562/1146813
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact