Bug 66852 - FILESAVE: Converting html to doc or pdf using soffice command line doesn't convert embedded image
Summary: FILESAVE: Converting html to doc or pdf using soffice command line doesn't co...
Status: NEW
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: Writer (show other bugs)
Version: 4.0.3.3 release
Hardware: Other All
: high normal
Assignee: Not Assigned
QA Contact:
URL:
Whiteboard: BSA
Keywords:
Depends on:
Blocks:
 
Reported: 2013-07-12 12:07 UTC by Ranieri Mazili
Modified: 2014-03-17 15:56 UTC (History)
0 users

See Also:
i915 platform:
i915 features:


Attachments

Description Ranieri Mazili 2013-07-12 12:07:19 UTC
Hi, 

you can see more details, examples of how to reproduce and print screen here: http://ask.libreoffice.org/en/question/20111/converting-files-using-soffice-convert-to-with-embedded-images-html-to-doc/

Resuming, if you have an embedded image (base64 using the img tag) in the html and then try to convert this file to a doc or pdf using the command line: soffice -headless -convert-to doc test.html

the doc file generated doesn't have the embedded image and left just a blank space where the image should be.
Operating System: Windows 7
Version: 4.0.4.2 release
Comment 1 Owen Genat 2013-07-13 02:55:45 UTC
Set status to NEW, version to v4.0.3.3, and platform to All as I can confirm this under GNU/Linux Crunchbang 11 x86-64 running TDF/LO v4.0.3.3. For the sake of simplicity an example file can be found here:

http://www.akamine.com.br/blog/imagem_embutida.html
Comment 2 Benoit Lathiere 2013-11-06 16:16:39 UTC
I've got the same problem (no picture, just spaces in the document) when I convert from html to docx with images linked (e.g. <img src="http://ww.xx.yy/aze.png"/>).

Context : Gnu/Linux Debian 7.2 / LibreOffice 3.5.
I tested all theses filters, without success :
:writer_OOXML =invalid document*,
:"Office Open XML Text" =no image,
:"MS Word 2007 XML" =no image,
:"MS Word 97" = invalid document*.

*the file contains HTML text !

No problem to convert to .doc, .pdf or .odt.
Comment 3 Oleg 2014-03-14 12:59:15 UTC
Hello to everyone!
Converting html with embedded images still doesn't work.
LibreOffice 4.2.2 (release at the moment)
OS: Windows 7, GNU/Linux i686 i386

Converting doesn't work for both linked images (like  <img src="http://ww.xx.yy/aze.png"/>) and for base64 encoded images.
Everytime I get ton of symbols and broken images.
Are there any updates?
Comment 4 Clement 2014-03-17 15:56:30 UTC
Hi,

This really sounds like an important issue as it produces completely junk output and breaks potentially very important features from using LibreOffice. 

Any update? Any idea of a direction where to dig? 

Thanks in advance


Use of freedesktop.org services, including Bugzilla, is subject to our Code of Conduct. How we collect and use information is described in our Privacy Policy.