[Bug 753480] matroskademux: text/x-raw subtitle tracks ouputs are escaped

GStreamer (GNOME Bugzilla) bugzilla at gnome.org
Fri Dec 18 08:55:29 PST 2015


https://bugzilla.gnome.org/show_bug.cgi?id=753480

--- Comment #5 from Pierre Lamot <pierre.lamot at yahoo.fr> ---
When the sutitle codec is S_TEXT/UTF8, the demuxer has no way to guess the
format. So matroskademux assume its in pango. when the line contains a markup
tag (<b>,<i> .. <span),

I can see different cases:

- The text is actual pango markup. after the first markup is encounter the text
  is returned as is (which is the expected behavior). Before, the text is
  escaped, this might cause issues with escaped characters: for instance
  "&" will be re-encoded as &amp;

- The text is srt (which is stored as S_TEXT/UTF8 [1]), srt markup is close to
  pango, the main difference is that <font> is used instead of <span> and that
  characters doesn't require to be escaped (but player support may vary), so
you
  can have things like actual "&", escaped character "&", even single "<"
or
  ">"

  Though the srt format doesn't seems to be properly defined. So I guess that
  decoding is kind of "best effort"

- The text is actual plain text. As our caps is set to pango, we have to escape
  it as it will be interpreted as pango.


So I guess the current solution is best solution t provide *pango*
markup. However, I think that ideally we could provide either raw (utf-8) or
pango
format using caps negotiation. But the current caps "selection" used fixed
format caps, so this might require some workaround for this case.

[1] http://www.matroska.org/technical/specs/subtitles/srt.html

-- 
You are receiving this mail because:
You are the QA Contact for the bug.
You are the assignee for the bug.


More information about the gstreamer-bugs mailing list