4WinMobile.com  

Go Back   4WinMobile.com > Windows Mobile - Third Party Support > uBook Official Support Forum

uBook Official Support Forum Support for uBook: µBook is a simple and lean, yet powerful ebook reader for Windows and Pocket PCs that can read HTML, TXT, RTF, PDB and PRC (not secure) ebook files. It can read directly from inside ZIP files, and supports BMP, GIF, PNG and JPG images. It offers many customization options including: Portrait and Landscape display, Choice of font type, color and size, etc.... It Also has a resizable/skinnable User Interface. Development site: http://www.gowerpoint.com/

Post New ThreadReply
 
LinkBack Thread Tools Search this Thread Display Modes
  #1 (permalink)  
Old 29-09-2008, 01:31 AM
 
Join Date: Sep 2008
Posts: 22
PDF to Html Converter

I've just spent a few days modifying the pdftohtml utility in poppler to produce Html output more suitable for uBook. The tool also now extracts all the images from a PDF document and hyperlinks them into the Html. You can find my version of the program here.

It's a C program and uses the poppler library, so it will probably only run on Linux. You should use the -noframes option when converting PDF files. And, like the original program, it doesn't cope well with multi-column PDF files.
Reply With Quote

Sponsors
  #2 (permalink)  
Old 06-10-2008, 06:28 PM
µBook's Avatar
VIP (Sponsor or Developer)
 
Join Date: Aug 2008
Location: Vancouver, BC, Canada
Posts: 98
Re: PDF to Html Converter

Awesome! I've been itching to delete the pseudo-PDF support from inside uBook
Can I add a link to your page on the uBook web site?
Thanks,
__________________
David 4 µBook
Reply With Quote
  #3 (permalink)  
Old 08-10-2008, 06:33 AM
 
Join Date: Sep 2008
Posts: 22
Re: PDF to Html Converter

Sure thing. I've provided the Poppler maintainers with a set of patches against the current branch, and I hope it gets integrated soon. Until then, point people at my pdftohtml website.
Reply With Quote
  #4 (permalink)  
Old 04-12-2008, 07:05 PM
 
Join Date: Dec 2008
Posts: 2
Re: PDF to Html Converter

From the sample conversion on your website, this program look like to be the best converter for generating ebook friendly files.

But how to install it and use it?

The Tarball don't include a readme and the manual page just explain the command line parameters.

can you explain to us how to install and run your utility?

What I've done so far :
With Ubuntu 8.04 32bit
I extracted the tarball in a folder, and tried the followings commands with no success :
./configure
make
sudo make install

I also tried to run pdftozip and to run it in terminal but nothing happen.
Reply With Quote
  #5 (permalink)  
Old 08-12-2008, 11:01 PM
 
Join Date: Sep 2008
Posts: 22
Re: PDF to Html Converter

Quote:
Originally Posted by faenad View Post
Can you explain to us how to install and run your utility?
Yes, I was hoping that the patches would be picked up and put into the main Poppler tree, but no such luck yet. There is no ./configure, so you can simply edit the Makefile to find the location of the header and library files:

Code:
# Compile flags for FreeBSD 6
#CXXFLAGS= -O2 -Wall -I. -I/usr/local/include/poppler -I/usr/local/include
#CFLAGS= $(CXXFLAGS)
#LIBDIRS= -L/usr/local/lib

# Compile flags for Ubuntu Linux
CXXFLAGS= -O2 -Wall -I. -I/usr/include/poppler
CFLAGS= $(CXXFLAGS)
LIBDIRS=
Comment out the existing lines (as shown above) and add the ones for Linux. Then you should be able to run the "make" command to build the executable, and then move it by hand into a directory like /usr/bin.
Reply With Quote
Sponsors
  #6 (permalink)  
Old 06-03-2009, 03:20 AM
 
Join Date: Mar 2009
Posts: 1
Re: PDF to Html Converter

OK, I'm having some problems here. I think I managed to get it installed because I have the executable files (pdftohtml pdftozip). How do I check to make sure these are the correct files? When I run, eg

pdftohtml -v

I get the exact same output from your custom version as I do from the standard version. I have made sure that I am executing the one I just compiled rather than the standard one. Should yours output different version information?

It's a little difficult to figure out what is going wrong, but it looks like lines are still being treated as separate paragraphs. Sometimes lines are joined, but it looks like there is a tab at the join point. Also, there is no gap from one paragraph to the next, so it's basically unreadable. The command I'm running is:

pdftohtml -raw -reflow -noframes -enc Latin1

I think I've tried every combination of those options without success. Thanks for any help, this program seems very promising if I could just get it to work.
Reply With Quote
  #7 (permalink)  
Old 28-08-2009, 08:40 PM
 
Join Date: Aug 2009
Posts: 1
Re: PDF to Html Converter

Hi, I have read an article about pdf to html conversion at www.nobleatom.com. I hope it will be helpful for you. Take a look. Thanks.
Reply With Quote
Sponsors
Post New ThreadReply

Bookmarks

Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are Off
Pingbacks are Off
Refbacks are Off



Friends of 4WM
Spb


All times are GMT. The time now is 11:47 AM.

Sponsors
 


Design by: vBulletin Skins Zone
Powered by vBulletin® Version 3.8.5
Copyright ©2000 - 2010, Jelsoft Enterprises Ltd.
Content Relevant URLs by vBSEO 3.3.2
(c) Daron Brewood - www.4winmobile.com 2004-2010
Ad Management by RedTyger


Page generated in 0.12701 seconds with 9 queries