I am trying to retrieve as many instances of a demonstrative in my corpus. The DEM may have multiple shapes/ orthographic representations, such as “ina”, “inad”, “ena”, “enad” (all possibly followed by a comma or a whitespace), for which I am using the following regex:
(i|e)na(?:,|\s|d)
When running the search many of the results are correctly matched, but not all of them. For example, if we consider the screenshot below, segment 492 is shown as a result (as it contains the string “ina”), but segment 494 and 495 containing the strings (“ina” and “ina,” respectively) are ignored.

Are there any solutions to this issue? Thanks in advance
