Can you really automate that problem? You could provide a "split at glyphs" function, but I doubt that would actually be useful without tons of caveats. Even English doesn't do split at glyphs well given the existence of ligatures. `flat -> fl a t`.
Not to mention you would need to make any such function language aware since different languages could theoretically have different mapping rules for the same sequence of characters.
Not to mention you would need to make any such function language aware since different languages could theoretically have different mapping rules for the same sequence of characters.