• Blue_Morpho@lemmy.world
    link
    fedilink
    arrow-up
    3
    arrow-down
    6
    ·
    4 months ago

    For instance you can’t write a regex that’ll relibly find the subject, object and verb in any english sentence

    Identifying parts of speech isn’t a requirement of the word parse. That’s the linguistic definition. In computer science identifying tokens is parsing.

    https://en.m.wikipedia.org/wiki/Parsing

    • notabot@lemm.ee
      link
      fedilink
      arrow-up
      9
      ·
      4 months ago

      That’s certainly one level of parsing, and sometimes alk you need, but as the article you posted says, it more usually refers to generating a parse tree. To do that in a natural language isn’t happening with a regex.

      • uranibaba@lemmy.world
        link
        fedilink
        arrow-up
        1
        ·
        4 months ago

        Thanks for all the explaining. I always wondered why you can’t parse HTML since I first saw the Stack Overflow post, when you can take any HTML code you find and write an expression to work against said set of data.

        I never understood the word parse to mean understanding and building a structure based on any input.