The “em-dashes” (—) come up a lot in online translations of books like Bible and Quran.

Normal keyboard “-” and “–” are different from “—” but microsoft office auto-formats “–” to that.

I kinda assumed it was ALL microsoft word data that caused training to include that.

I am only now realizing AI stole from even the religious texts and influenced by them as well.

  • AbouBenAdhem@lemmy.world
    link
    fedilink
    English
    arrow-up
    33
    ·
    3 months ago

    Each dash has a different use case that all professionally-typeset books adhere to (not just religious texts).

    Hyphens are for compound words; en-dashes are for ranges or (on rare cases) to disambiguate multiple levels of hyphens; and em-dashes are for parenthetical dashes (for publishers who don’t use spaced en-dashes instead).

  • gedaliyah@lemmy.world
    link
    fedilink
    arrow-up
    27
    ·
    3 months ago

    Well, maybe a little. Em dashes and en dashes are pretty standard (and editorially enforced) in newspapers and academic journals. By length, every religious text is eclipsed by news and journal media on a daily basis.

  • A_norny_mousse@feddit.org
    link
    fedilink
    arrow-up
    1
    ·
    edit-2
    3 months ago

    Software converts human-typed comments to use fancy quotes, dashes and other punctuation. Even this platform does that with the Markdown extension Fancypants - look at the quotes in your post.

    That’s where LLMs get this from.

    E.g.: to get an em-dash here: --- => —