<html>
<head>
<base href="https://bugs.documentfoundation.org/">
</head>
<body>
<p>
<div>
<b><a class="bz_bug_link
bz_status_NEW "
title="NEW - Using Unicode conversion on a combined emoji results in only partial conversion"
href="https://bugs.documentfoundation.org/show_bug.cgi?id=138591#c4">Comment # 4</a>
on <a class="bz_bug_link
bz_status_NEW "
title="NEW - Using Unicode conversion on a combined emoji results in only partial conversion"
href="https://bugs.documentfoundation.org/show_bug.cgi?id=138591">bug 138591</a>
from <span class="vcard"><a class="email" href="mailto:jluth@mail.com" title="Justin L <jluth@mail.com>"> <span class="fn">Justin L</span></a>
</span></b>
<pre>This might be VERY complex to do completely correctly, since there does not
seem to be a single standard way of marking combining combinations of emojis.
The latest version of the "Unicode Emoji" spec can be found at
<a href="http://www.unicode.org/reports/tr51/">http://www.unicode.org/reports/tr51/</a>.
Having glanced through the spec, I imagine adding some kind of logic like:
if ( maInput.getLength() == 0 )
bIsEmojiSequence = isEmoji();
if ( isEmoji_modifier_base() )
bHaveEmoji_modifier_base = true;
const nZWJ == fe0f; //Zero Width Joiner character
if ( bIsEmojiSquence )
{
if ( next == nZWJ || (isEmoji(next) && !bHaveEmoji_modifier_base) )
then continue to accept new characters.
}
It looks like this will require some low-level identification of emoji, since
there is no classification yet such as
::com::sun::star::i18n::UnicodeType::EMOJI</pre>
</div>
</p>
<hr>
<span>You are receiving this mail because:</span>
<ul>
<li>You are the assignee for the bug.</li>
</ul>
</body>
</html>