Safetensors
Italian
mrovera commited on
Commit
8663be4
·
verified ·
1 Parent(s): fbf075b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +30 -11
README.md CHANGED
@@ -8,7 +8,7 @@ language:
8
  # modafact-ita
9
 
10
  `modafact-ita` is a sequence-to-sequence fine-tuned model for joint **event Factuality and Modality** detection in **Italian**.
11
- The model was fine-tuned on [ModaFact](https://huggingface.co/datasets/dhfbk/modafact-ita), a dataset manually annotated with Factuality and Modality values.
12
 
13
 
14
  ## Model Details
@@ -25,8 +25,9 @@ The model was fine-tuned on [ModaFact](https://huggingface.co/datasets/dhfbk/mod
25
 
26
  <!-- Provide the basic links for the model. -->
27
 
28
- - **Paper:** coming soon!
29
- <!-- - **Inference script:** [https://github.com/dhfbk/ModaFact](https://github.com/dhfbk/ModaFact) -->
 
30
 
31
  ## Uses
32
 
@@ -47,17 +48,35 @@ and outputs a sequence of span=labels, in this format:
47
 
48
  [https://huggingface.co/datasets/dhfbk/modafact-ita](https://huggingface.co/datasets/dhfbk/modafact-ita)
49
 
50
- #### Training Hyperparameters
51
-
52
- - **Hyperparameters:**
53
 
54
-
55
- ## Evaluation
56
 
57
 
58
  ## Citation
59
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
60
 
61
- **BibTeX:**
62
-
63
- coming soon!
 
8
  # modafact-ita
9
 
10
  `modafact-ita` is a sequence-to-sequence fine-tuned model for joint **event Factuality and Modality** detection in **Italian**.
11
+ The model was fine-tuned on [ModaFact](https://huggingface.co/datasets/dhfbk/modafact-ita), a dataset manually annotated with Factuality and Modality values, using [mT5-xxl](https://huggingface.co/google/mt5-xxl) as a base model.
12
 
13
 
14
  ## Model Details
 
25
 
26
  <!-- Provide the basic links for the model. -->
27
 
28
+ - **Inference script:** if you want to use the model for inference, please refer to our [github repo](https://github.com/dhfbk/ModaFact).
29
+ - **Paper:** [ModaFact: Multi-paradigm Evaluation for Joint Event Modality and Factuality Detection]([https://aclanthology.org/2025.coling-main.425/])
30
+
31
 
32
  ## Uses
33
 
 
48
 
49
  [https://huggingface.co/datasets/dhfbk/modafact-ita](https://huggingface.co/datasets/dhfbk/modafact-ita)
50
 
51
+ <!-- #### Training Hyperparameters -->
 
 
52
 
53
+ <!-- ## Evaluation -->
 
54
 
55
 
56
  ## Citation
57
 
58
+ If you use or refer to ModaFact, please consider citing this paper:
59
+
60
+ ```
61
+ @inproceedings{rovera-etal-2025-modafact,
62
+ title = "{M}oda{F}act: Multi-paradigm Evaluation for Joint Event Modality and Factuality Detection",
63
+ author = "Rovera, Marco and
64
+ Cristoforetti, Serena and
65
+ Tonelli, Sara",
66
+ editor = "Rambow, Owen and
67
+ Wanner, Leo and
68
+ Apidianaki, Marianna and
69
+ Al-Khalifa, Hend and
70
+ Eugenio, Barbara Di and
71
+ Schockaert, Steven",
72
+ booktitle = "Proceedings of the 31st International Conference on Computational Linguistics",
73
+ month = jan,
74
+ year = "2025",
75
+ address = "Abu Dhabi, UAE",
76
+ publisher = "Association for Computational Linguistics",
77
+ url = "https://aclanthology.org/2025.coling-main.425/",
78
+ pages = "6378--6396",
79
+ abstract = "Factuality and modality are two crucial aspects concerning events, since they convey the speaker`s commitment to a situation in discourse as well as how this event is supposed to occur in terms of norms, wishes, necessity, duty and so on. Capturing them both is necessary to truly understand an utterance meaning and the speaker`s perspective with respect to a mentioned event. Yet, NLP studies have mostly dealt with these two aspects separately, mainly devoting past efforts to the development of English datasets. In this work, we propose ModaFact, a novel resource with joint factuality and modality information for event-denoting expressions in Italian. We propose a novel annotation scheme, which however is consistent with existing ones, and compare different classification systems trained on ModaFact, as a preliminary step to the use of factuality and modality information in downstream tasks. The dataset and the best-performing model are publicly released and available under an open license."
80
+ }
81
+ ```
82