concatenated profane words make "false positive" prediction #53

ciapecki · 2024-10-01T22:49:23Z

predict_prob(['fuck','shit','fuckshit'])
#[1. 0.99999982 0.03636672]

Is there a possibility to treat the last element of array as profane?

The text was updated successfully, but these errors were encountered:

dimitrismistriotis · 2024-10-02T13:45:16Z

Thanks for the issue.

We had similar discussions in the past including for when the code was "living" in Gitlab, nice to have it here for reference.

In order to do so we should update the dataset with more sentences having fuckshit annotated as profane. Currently we are using the original dataset and have not discussed updating it, although I am open to the possibility if one can appoint a good corpus.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

concatenated profane words make "false positive" prediction #53

concatenated profane words make "false positive" prediction #53

ciapecki commented Oct 1, 2024

dimitrismistriotis commented Oct 2, 2024

concatenated profane words make "false positive" prediction #53

concatenated profane words make "false positive" prediction #53

Comments

ciapecki commented Oct 1, 2024

dimitrismistriotis commented Oct 2, 2024