<html> <head> <base href="https://bugs.documentfoundation.org/"> </head> <body> <div> <a class="bz_bug_link bz_status_NEW " title="NEW - Using Unicode conversion on a combined emoji results in only partial conversion" href="https://bugs.documentfoundation.org/show_bug.cgi?id=138591#c4">Comment # 4</a> on <a class="bz_bug_link bz_status_NEW " title="NEW - Using Unicode conversion on a combined emoji results in only partial conversion" href="https://bugs.documentfoundation.org/show_bug.cgi?id=138591">bug 138591</a> from <a class="email" href="mailto:jluth@mail.com" title="Justin L <jluth@mail.com>"> Justin L</a> <pre>This might be VERY complex to do completely correctly, since there does not seem to be a single standard way of marking combining combinations of emojis. The latest version of the "Unicode Emoji" spec can be found at <a href="http://www.unicode.org/reports/tr51/">http://www.unicode.org/reports/tr51/</a>. Having glanced through the spec, I imagine adding some kind of logic like: if ( maInput.getLength() == 0 ) bIsEmojiSequence = isEmoji(); if ( isEmoji_modifier_base() ) bHaveEmoji_modifier_base = true; const nZWJ == fe0f; //Zero Width Joiner character if ( bIsEmojiSquence ) { if ( next == nZWJ || (isEmoji(next) && !bHaveEmoji_modifier_base) ) then continue to accept new characters. } It looks like this will require some low-level identification of emoji, since there is no classification yet such as ::com::sun::star::i18n::UnicodeType::EMOJI</pre> </div> <hr> You are receiving this mail because: <ul> <li>You are the assignee for the bug.</li> </ul> </body> </html>