[Clipart] [Bug 19449] New: Ridiculous number of duplicates
bugzilla-daemon at freedesktop.org
bugzilla-daemon at freedesktop.org
Wed Jan 7 13:43:22 PST 2009
http://bugs.freedesktop.org/show_bug.cgi?id=19449
Summary: Ridiculous number of duplicates
Product: openclipart.org
Version: unspecified
Platform: Other
OS/Version: All
Status: NEW
Severity: normal
Priority: medium
Component: clipart
AssignedTo: clipart at lists.freedesktop.org
ReportedBy: jhansonxi at gmail.com
Created an attachment (id=21767)
--> (http://bugs.freedesktop.org/attachment.cgi?id=21767)
Listing of cuplicate files
I installed the Windows version of the library and noticed a lot of duplicates
when adding them to the OpenOffice.org Gallery. I then downloaded
openclipart-0.18-full.tar.bz2 and did a duplicate check (on Ubuntu) and found
thousands. The commands I used are below and were run from the clipart
directory:
find . -type f -exec md5sum '{}' \; >md5_listing.txt
sort md5_listing.txt | uniq -d -w32 | cut -c 1-32 >md5_duplicates.txt
grep -f md5_duplicates.txt md5_listing.txt | sort >duplicates_listing.txt
My report is attached.
--
Configure bugmail: http://bugs.freedesktop.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
More information about the clipart
mailing list