[Libreoffice-bugs] [Bug 48672] New: LibreOffice (and OpneOffice) does not provide a search by content

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Fri Apr 13 22:55:22 CEST 2012


https://bugs.freedesktop.org/show_bug.cgi?id=48672

             Bug #: 48672
           Summary: LibreOffice (and OpneOffice) does not provide a search
                    by content
    Classification: Unclassified
           Product: LibreOffice
           Version: unspecified
          Platform: All
        OS/Version: Linux (All)
            Status: UNCONFIRMED
          Severity: normal
          Priority: medium
         Component: Libreoffice
        AssignedTo: libreoffice-bugs at lists.freedesktop.org
        ReportedBy: jtaller2006 at yahoo.com


Created attachment 59942
  --> https://bugs.freedesktop.org/attachment.cgi?id=59942
This document contains text "abc". It can't be found by search engine by
content.

2012/04/13

This query:
why libreoffice does not search by content
Produces 235,000 hits.

For reference pls see:
http://en.libreofficeforum.org/node/1141
http://ask.libreoffice.org/question/1814/why-libre-does-not-provide-a-search-by-content

This is a problem of coordination between Linux and Open/LibreOffice.
I would like to propose to Linux that any new document formats generated by
OpenOffice, LibreOffice, WhateverFutureOffice will come with a plugin that gets
called by Linux search program. For example Ubuntu has a program called "search
for files". The program will not find by content documents like .odt .ott
.docx. That is because these documents use nested files format. It should be
required for any new word processor with new data format to provide a plugin
which would be hookable (I don't mean se-x) into the search engine. Yes, I did
find some scripts that can look inside the .odt but those scripts are unusable
when you have 100,000 documents. (I do have that many.) 
Suggestions:
While this could be very complex problem if you want to search XML files by
tags, I am advocating simple solution that will satisfy most users. The .odt
and .docx and others contain a collection of zipped files one of them contains
the data called content.xml. I would like you to unzip the .odt/.ott/.docx
(etc) into temp dir, then search content.xml and report that f1.odt if f1.odt
contains the content we are looking for. This should be integrated with Linux
search GUI so that I can click on the link to the file. If you can isolate in
content.xml the section that contains the user data, that would be great.
Alternately, If you treat the whole content.xml as the user data, you will get
some false hits but I am willing to live with that until you write something
that looks only at the user's data inside content.xml. But the current
situation is so unacceptable that in the last year I started to use .doc format
on my Ubuntu because I can then find my files. -- sad story. Thank you for
listening.

Attachment:
This document contains text "abc". It will not be found by search engine by
content on Ubuntu/Gnome using Ubuntu search engine.

-- 
Configure bugmail: https://bugs.freedesktop.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.



More information about the Libreoffice-bugs mailing list