Langanzeige der Metadaten
DC Element | Wert | Sprache |
---|---|---|
dc.contributor.author | Hrycyk, Lianna | - |
dc.contributor.author | Zarcone, Alessandra | - |
dc.contributor.author | Hahn, Luzian | - |
dc.date.accessioned | 2021-09-21T07:32:06Z | - |
dc.date.available | 2021-09-21T07:32:06Z | - |
dc.date.issued | 2021-09 | - |
dc.identifier.uri | https://fordatis.fraunhofer.de/handle/fordatis/213 | - |
dc.identifier.uri | http://dx.doi.org/10.24406/fordatis/140 | - |
dc.description.abstract | The inCLINC dataset (incremental intent annotations of the CLINC dataset) contains 121 distinct utterances (queries directed to a voice assistant) in their complete form and in partial form for a total of 538 utterances, which were labeled with intent categories in a crowdsourcing study by 126 coders. The tagset consisted of 37 intent categories plus one out-of-scope category. Each utterance was annotated by 6 to 9 coders. To refer to inCLINC in any publication, please cite the following paper: Hrycyk, L., Zarcone, A., & Hahn, L. (2021). Not So Fast, Classifier – Accuracy and Entropy Reduction in Incremental Intent Classification. In Proceedings of the 3rd Workshop on NLP for Conversational AI (NLP4ConvAI 2021). | en |
dc.language.iso | en | en |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | en |
dc.subject | intent | en |
dc.subject | incrementality | en |
dc.subject | dialogue data | en |
dc.subject | NLU | en |
dc.subject | voice assistants | en |
dc.subject | crowdsourcing | en |
dc.subject.ddc | DDC::400 Sprache | en |
dc.subject.ddc | DDC::000 Informatik, Informationswissenschaft, allgemeine Werke | en |
dc.title | inCLINC: incremental intent annotations of the CLINC dataset | en |
dc.type | Textual Data | en |
dc.contributor.funder | Bundesministerium fur Wirtschaft und Energie BMWi (Deutschland) | en |
fordatis.institute | IIS Fraunhofer-Institut für Integrierte Schaltungen | en |
fordatis.project.fhgid | 210011 | en |
fordatis.rawdata | false | en |
fordatis.sponsorship.FundingProgramme | Innovationswettbewerb "Künstliche Intelligenz als Treiber für volkswirtschaftlich relevante Ökosysteme" | en |
fordatis.sponsorship.projectid | FKZ 01MK20011A | en |
fordatis.sponsorship.projectname | SPEAKER - Aufbau einer führenden Sprachassistenzplattform ”Made in Germany” | en |
fordatis.sponsorship.projectacronym | SPEAKER | en |
fordatis.date.start | 2020-10 | - |
fordatis.date.end | 2020-12 | - |
Enthalten in den Sammlungen: | Fraunhofer-Institut für Integrierte Schaltungen IIS |
Dateien zu dieser Ressource:
Datei | Beschreibung | Größe | Format | |
---|---|---|---|---|
data_entropy_reduction_majority.csv | the main dataset | 78,71 kB | CSV | Öffnen/Download |
user_responses.csv | all labels assigned to each stimulus | 85,14 kB | CSV | Öffnen/Download |
README.md | 2,37 kB | Unknown | Öffnen/Download |
Diese Ressource wurde unter folgender Copyright-Bestimmung veröffentlicht: Lizenz von Creative Commons