Skip to main content

Convert Word Document to Other File Formats (Pdf, Image, Html, Txt etc.) in Java

Convert Word document to other file formats is undoubtedly one of the most common requirements when we working with Word document. In this article, we’ll learn how to convert Word document to other file formats programmatically in Java by using Free Spire.Doc for Java library.

Free Spire.Doc for Java library let us convert Word document to a series of file formats easily and effectively without having Microsoft Office to be installed on our system. The following conversions are supported by Free Spire.Doc for Java:
  • Microsoft Word file format conversions
  • Convert Word to Pdf
  • Convert Word to Image
  • Convert Word to Html
  • Convert Word to Rtf
  • Convert Word to Svg
  • Convert Word to Txt
  • Convert Word to Xps
  • Convert Word to Pcl
  • Convert Word to Post Script
  • Convert Word to Odt
  • Convert Word to Epub
  • Convert Word to WordML/WordXML

In the following examples, we’ll see how to convert Word document to Pdf, Image, Html and Svg.

Convert Word to Pdf

The following example demonstrates how to convert a Word document to Pdf file format.
public class ConvertWordToPDF {
   
public static void main(String[] args){
        Document doc =
new Document("Input.docx");

        doc.saveToFile(
"Output.pdf", FileFormat.PDF);
   
}
}
Convert Word to PDF

Free Spire.Doc for Java also allows us to convert Word to Pdf with custom settings, such as whether to embed fonts, whether to disable link, as shown in below code.
public class ConvertWordToPDF {
   
public static void main(String[] args){
        Document doc =
new Document("Input.docx");

       
ToPdfParameterList parameterList = new ToPdfParameterList();
        parameterList.isEmbeddedAllFonts(
true);
        parameterList.setDisableLink(
true);

        doc.saveToFile(
"Output.pdf", parameterList);
   
}
}

Convert Word to Image

Free Spire.Doc for Java supports converting a specific page of a Word document to an image as well as converting the whole document to images. The following example shows how to convert the first page of a Word document to image.
public class ConvertWordToImage {
   
public static void main(String[] args){
        Document doc =
new Document("Input.docx");

        BufferedImage image= doc.saveToImages(
0, ImageType.Bitmap);

        File file=
new File("Output.png");
       
try {
            ImageIO.write(image,
"PNG", file);
        }
catch (IOException e) {
            e.printStackTrace();
        }
    }
}
Convert Word to Image
  
Convert Word to Html

The following example shows how to convert a Word document to Html file format.
 public class ConvertWordToHTML {
   
public static void main(String[] args){
        Document doc =
new Document("Input.docx");

        doc.saveToFile("Output.html", FileFormat.Html);
   
}
}
Convert Word to Html
 
Convert Word to Svg

The following example illustrates how to convert a Word document to Svg file format.
public class ConvertWordToPDF {
   
public static void main(String[] args){
        Document doc =
new Document("Input.docx");       

       
doc.saveToFile("Output.svg", FileFormat.Svg);
   
}
}
Convert Word to Svg

Note: The free version is limited to 3 pages of conversion, if you want to convert more pages, you'll need to upgrade to the professional version of Spire.Doc for Java 

More information

Comments

Popular posts from this blog

3 Ways to Generate Word Documents from Templates in Java

A template is a document with pre-applied formatting like styles, tabs, line spacing and so on. You can quickly generate a batch of documents with the same structure based on the template. In this article, I am going to show you the different ways to generate Word documents from templates programmatically in Java using Free Spire.Doc for Java library. Prerequisite First of all, you need to add needed dependencies for including Free Spire.Doc for Java into your Java project. There are two ways to do that. If you use maven, you need to add the following code to your project’s pom.xml file. <repositories>               <repository>                   <id>com.e-iceblue</id>                   <name>e-iceblue</name> ...

Insert and Extract OLE objects in Word in Java

You can use OLE (Object Linking and Embedding) to include content from other programs, such as another Word document, an Excel or PowerPoint document to an existing Word document. This article demonstrates how to insert and extract embedded OLE objects in a Word document in Java by using Free Spire.Doc for Java API.   Add dependencies First of all, you need to add needed dependencies for including Free Spire.Doc for Java into your Java project. There are two ways to do that. If you use maven, you need to add the following code to your project’s pom.xml file.     <repositories>               <repository>                   <id>com.e-iceblue</id>                   <name>e-iceblue</name>     ...

Remove Duplicate Rows in Excel in C# and VB.NET

When an Excel file contains a huge amount of records, there might be duplicate records as well. In this blog, I am going to show you how to remove the duplicate rows in an Excel file programmatically in C# and VB.NET. The library I used: Free Spire.XLS for .NET Free Spire.XLS for .NET is a feature-rich Excel API offered by E-iceblue. It can be easily integrated in your .NET (C#, VB.NET, ASP.NET, .NET Core) applications to create, read, edit, convert and print Excel files without using Microsoft Office. Before coding, you need to get Free Spire.XLS for .NET by installing it via NuGet or downloading it via the official website . C# Code using  Spire.Xls;   using  System.Linq;      namespace  RemoveDuplicateRows   {        class  Program       {            static   void  Main( string [] args) ...