Counting spurious entities #15

anjiefang · 2019-09-17T00:10:19Z

Hi,

I found an issue when counting spurious. In lines 317-322 in ner_eval.py, if a spurious entity is found, +1 is performed for all types of entities. Shoud it be performed for only one type of entity? :

for true in tags:   
    evaluation_agg_entities_type[true]['strict']['spurious'] += 1
     evaluation_agg_entities_type[true]['ent_type']['spurious'] += 1
     evaluation_agg_entities_type[true]['partial']['spurious'] += 1
     evaluation_agg_entities_type[true]['exact']['spurious'] += 1

change to

evaluation_agg_entities_type[pred.e_type]['strict']['spurious'] += 1
evaluation_agg_entities_type[pred.e_type]['ent_type']['spurious'] += 1
evaluation_agg_entities_type[pred.e_type]['partial']['spurious'] += 1 
evaluation_agg_entities_type[pred.e_type]['exact']['spurious'] += 1

?

Thanks.
Andy.

The text was updated successfully, but these errors were encountered:

ivyleavedtoadflax · 2019-09-20T17:08:37Z

Hi Andy, thanks very much for taking the time to create an issue. Looking at the code, it looks like I was unsure about this too as I left this comment:

                # or when it simply does not appear in the test set, then it is
                # spurious, but it is not clear where to assign it at the tag
                # level. In this case, it is applied to all target_tags
                # found in this example. This will mean that the sum of the
                # evaluation_agg_entities will not equal evaluation.

What do you think about it @davidsbatista?

ivyleavedtoadflax · 2019-09-20T17:10:17Z

Also @anjiefang you may be interested to see that we started to convert this code into a module here https://github.com/ivyleavedtoadflax/nervaluate, although we've not got too much further yet. I have a task coming up for which I will need to use it, so I hope to get more time to develop it in the near future.

amlarraz · 2021-06-16T12:01:19Z

I'm working with the library and I've found a (i think) mistake in this part of the code.
When the predicted entity is not in the true-entities list and the offsets do not match exactly with any of the true-entities and it do not have any overlap with any of the true-entities the code add 1 to all the labels in the 'spurious' field. This is because the note:

NOTE: when pred.e_type is not found in tags
or when it simply does not appear in the test set, then it is
spurious, but it is not clear where to assign it at the tag
level. In this case, it is applied to all target_tags
found in this example. This will mean that the sum of the
evaluation_agg_entities will not equal evaluation

but there is no check to ensure that the predicted label is not in the label set.

Maybe it is neccesary to add if pred.e_type not in tags: before the for true in tags: here?

ivyleavedtoadflax · 2021-06-16T15:06:16Z

Hi @amlarraz. Many thanks for your comment. Could you possibly open a PR for this?

amlarraz · 2021-06-17T09:03:37Z

No problem, I've just created the pull request.
Many thanks for your work!

ivyleavedtoadflax mentioned this issue Jun 25, 2021

Counting spurious entities MantisAI/nervaluate#42

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Counting spurious entities #15

Counting spurious entities #15

anjiefang commented Sep 17, 2019 •

edited

ivyleavedtoadflax commented Sep 20, 2019 •

edited

ivyleavedtoadflax commented Sep 20, 2019

amlarraz commented Jun 16, 2021

ivyleavedtoadflax commented Jun 16, 2021

amlarraz commented Jun 17, 2021

Counting spurious entities #15

Counting spurious entities #15

Comments

anjiefang commented Sep 17, 2019 • edited

ivyleavedtoadflax commented Sep 20, 2019 • edited

ivyleavedtoadflax commented Sep 20, 2019

amlarraz commented Jun 16, 2021

ivyleavedtoadflax commented Jun 16, 2021

amlarraz commented Jun 17, 2021

anjiefang commented Sep 17, 2019 •

edited

ivyleavedtoadflax commented Sep 20, 2019 •

edited