kth.sePublikationer KTH
Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
DUETS: A Dataset of Reproducible Pairs of Java Library-Clients
KTH, Skolan för elektroteknik och datavetenskap (EECS), Datavetenskap, Programvaruteknik och datorsystem, SCS.ORCID-id: 0000-0002-1996-6134
KTH, Skolan för elektroteknik och datavetenskap (EECS), Datavetenskap, Programvaruteknik och datorsystem, SCS.ORCID-id: 0000-0003-0541-6411
KTH, Skolan för elektroteknik och datavetenskap (EECS), Datavetenskap, Programvaruteknik och datorsystem, SCS.ORCID-id: 0000-0002-4015-4640
2021 (Engelska)Ingår i: 2021 IEEE/Acm 18Th International Conference On Mining Software Repositories (MSR 2021), Institute of Electrical and Electronics Engineers (IEEE) , 2021, s. 545-549Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

Software engineering researchers look for software artifacts to study their characteristics or to evaluate new techniques. In this paper, we introduce DUETS, a new dataset of software libraries and their clients. This dataset can be exploited to gain many different insights, such as API usage, usage inputs, or novel observations about the test suites of clients and libraries. DUETS is meant to support both static and dynamic analysis. This means that the libraries and the clients compile correctly, they are executable and their test suites pass. The dataset is composed of open-source projects that have more than five stars on GitHub. The final dataset contains 395 libraries and 2;874 clients. Additionally, we provide the raw data that we use to create this dataset, such as 34;560 pom.xml files or the complete file list from 34;560 projects. This dataset can be used to study how libraries are used by their clients or as a list of software projects that successfully build. The client's test suite can be used as an additional verification step for code transformation techniques that modify the libraries.

Ort, förlag, år, upplaga, sidor
Institute of Electrical and Electronics Engineers (IEEE) , 2021. s. 545-549
Serie
IEEE International Working Conference on Mining Software Repositories, ISSN 2160-1852
Nyckelord [en]
Mining software repositories, software reuse, Java, Maven
Nationell ämneskategori
Programvaruteknik
Identifikatorer
URN: urn:nbn:se:kth:diva-303381DOI: 10.1109/MSR52588.2021.00071ISI: 000693399500059Scopus ID: 2-s2.0-85108531256OAI: oai:DiVA.org:kth-303381DiVA, id: diva2:1602849
Konferens
29th IEEE/ACM International Conference on Program Comprehension (ICPC) / 18th IEEE/ACM International Conference on Mining Software Repositories (MSR), MAY 22-30, 2021, ELECTR NETWORK
Anmärkning

Part of proceedings: ISBN 978-1-7281-8710-5, QC 20230117

Tillgänglig från: 2021-10-13 Skapad: 2021-10-13 Senast uppdaterad: 2023-01-17Bibliografiskt granskad

Open Access i DiVA

Fulltext saknas i DiVA

Övriga länkar

Förlagets fulltextScopus

Person

Durieux, ThomasSoto Valero, CésarBaudry, Benoit

Sök vidare i DiVA

Av författaren/redaktören
Durieux, ThomasSoto Valero, CésarBaudry, Benoit
Av organisationen
Programvaruteknik och datorsystem, SCS
Programvaruteknik

Sök vidare utanför DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetricpoäng

doi
urn-nbn
Totalt: 82 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf