Java find and replace text on Word document
E-iceblue Product Family
Posted on April 29, 2019
On Microsoft Word, we can find special text string and replace them with new text strings easily. This article will mention various approaches of the finding and replacing feature on the word
document by using Java API Spire.Doc independently (without using any third-party code) as below.
- Find and replace text string in Word with new text string
- Find and replace Text by Regular Expressions
- Find and replace Text with HTML
Find and replace text string in Word with new text string. All the searched results will be detected and replaced at one time.
import com.spire.doc.*;
import com.spire.doc.documents.TextSelection;
import java.awt.*;
public class Word {
public static void main(String[] args) {
String input = "replaceWithText.docx";
String output = "out/replaceWithText.docx";
//load Word document
Document document = new Document();
document.loadFromFile(input, FileFormat.Docx);
//replace the text
document.replace("Word", "NewWord", false, true);
//highlight the replaced text
TextSelection[] textSelections = document.findAllString("NewWord", false, true);
//set highlight
for (TextSelection selection : textSelections) {
selection.getAsOneRange().getCharacterFormat().setHighlightColor(Color.YELLOW);
}
//save the document
document.saveToFile(output,FileFormat.Docx);
}
}
Find the words that match a specific regular expression in a Word document and replace the matched words with a new string. The following sample will show you how to replace the regular expression
for the words that start with”#”.
import com.spire.doc.*;
import java.util.regex.Pattern;
import com.spire.doc.documents.TextSelection;
import java.awt.*;
public class Word {
public static void main(String[] args) {
String input = "replaceTextByRegex.docx";
String output = "out/replaceTextByRegex.docx";
//load Word document
Document document = new Document();
document.loadFromFile(input, FileFormat.Docx);
//create a regex, match the text that starts with "#"
String regEx="\\#\\w+\\b";
Pattern pattern= Pattern.compile(regEx);
//replace the text by regex
document.replace(pattern,"JavaSpire.Doc");
//highlight the replaced text
TextSelection[] textSelections = document.findAllString("JavaSpire.Doc", false, true);
//set highlight
for (TextSelection selection : textSelections) {
selection.getAsOneRange().getCharacterFormat().setHighlightColor(Color.YELLOW);
}
//save the document
document.saveToFile(output,FileFormat.Docx);
}
}
Find and replace text in word with HTML.
import com.spire.doc.*;
import com.spire.doc.documents.*;
import com.spire.doc.fields.TextRange;;
import java.io.*;
public class Word{
public static void main(String[] args) throws Exception{
String input = "replaceWithHtml.docx";
String inputHtml = "replaceWithHtml.txt";
String output = "out/replaceWithHtml.docx";
//load Word document
Document document = new Document();
document.loadFromFile(input, FileFormat.Docx);
//find a paragraph and replace its content with html
TextSelection textSelection = document.findString("[#placeholder]", false, true);
TextRange textrange =textSelection.getAsOneRange();
Paragraph par= textrange.getOwnerParagraph();
String htmlText = readTextFromFile(inputHtml);
par.appendHTML(htmlText);
par.getChildObjects().removeAt(0);
//save the document
document.saveToFile(output,FileFormat.Docx);
}
public static String readTextFromFile(String fileName) throws IOException{
StringBuffer sb = new StringBuffer();
BufferedReader br = new BufferedReader(new FileReader(fileName));
String content = null;
while ((content = br.readLine()) != null) {
sb.append(content);
}
return sb.toString();
}
}
Conclusion
It is quite easy for us to use Spire.Doc to find and replace the text on the word document with new text strings and HTML in Java applications. Thanks for your reading and wish it helps.
Posted on April 29, 2019
Join Our Newsletter. No Spam, Only the good stuff.
Sign up to receive the latest update from our blog.