Today I am at SetApp stumbled across an exciting tool that could have helped me many times if only I had known: TextSoap from unmarked software. To put it simply, TextSoap is a text editor that offers a rich arsenal of functions for cleaning up text. And the cleaning itself can be designed freely or you can put together your desired "cleaning procedure" from existing routines. Would you like a few simple application examples? Here you are:
- Some authors forget to add a space after the colon after a colon in a list.
- Almost everyone has double or triple spaces in their texts.
- Every now and then I get texts from customers or authors who don't have a space after the comma or sometimes a space before the comma.
- In many cases, a hyphen is used instead of a dash in a text - although a dash would belong.
- A sequence of three dots is often typed as an ellipsis for quotations, but these should actually be entered as special characters (on Mac with ALT + PERIOD) called "ellipsis".
- Every now and then I need single quotes for text (the variant that only appears at the top), but in the client's Word document there are typographic quotes (where the first is at the bottom).
Chapter in this post:
- 1 My previous solution: Search and replace via PHP script or in Pages
- 2 Over 100 ready-made functions available
- 3 Very powerful: the ability to build cleansing functions yourself
- 4 The best: Batch File Cleaning
- 5 Conclusion: A must if there is a lot of “foreign text”.
- 6 Screenshots of the TextSoap settings
- 7 Similar posts
My previous solution: Search and replace via PHP script or in Pages
As you can see, if you often work with other people's texts, you get a lot of possible problems that you have to solve. You shouldn't actually tell it at all, but I had programmed a PHP script for this from time to time, which solved several tasks at the same time for me using search and replace. This is of course really cumbersome and really only worthwhile if you have a lot of texts that you then clean up by copying and pasting. But it is at least time-saving than in Word or Pages using the "Search and Replace" function to gradually fix the problem cases.
Over 100 ready-made functions available
The Mac software TextSoap, on the other hand, is a small marvel for this area of application, because it offers so many ready-made cleaning functions that hardly anything is left to be desired. There are even features that correct the spelling of product names from major tech companies. An “Iphone” then automatically becomes an “iPhone” – even if the Mac usually offers this as a correction when typing.
In addition to such corrections of words or punctuation, there are also many functions that affect the form of the text. For example, if you have an email that contains quotes from an earlier email, these are often preceded by square brackets “>”. There is also a function in TextSoap that cleans up such things. There are also functions that convert back and forth between DOS, Unix and Mac line breaks.
In order to briefly introduce a few more routines, I have put together a small list here, but it does not cover all the possibilities by far. You can, among other things:
- Get all URLs out of a text and then attach them as a list to the original text
- Sort the rows (ascending or descending)
- reverse the sequence of all words in the text
- Randomize all rows (randomize) - I will use this to determine the winners in my raffles in the future!
- Convert rich text to plain text
- Convert URLs in the text to hyperlinks
- Change all dates in a certain format (YYYY-MM-DD, DD-MM-YYYY or similar)
- convert a string into hexadecimal information
- set up a notification when a certain number of characters or words is reached
- swap multiple line breaks for one
- Convert text to “web friendly text”.
- Insert line breaks according to the specified number of characters (50, 60 or 70 characters)
- Convert bullet points to hyphens
Very powerful: the ability to build cleansing functions yourself
As you can see, there is a function for almost every eventuality. And for what does not yet have a function, you can retrofit it yourself in the "Custom Editor". It's relatively easy to do, since you can fall back on predefined functions in this area, which you can then adapt accordingly.
In addition to normal search-and-replace, there is also the option of cleaning up texts with regular expressions. I programmed a short function of my own here and then added some existing cleaning functions to the command. This complete set can then be freely named and released in the future with a click on the text in the editor.
The best: Batch File Cleaning
If you think it can hardly be better, then you will find another function that tops it all off: Batch File Cleaning. This allows you to apply your cleaning routine to a whole bunch of texts (format: plain text (txt) or rich text (rtf)). The hearts of many shop owners who have to revise hundreds of product texts to remove certain areas, change spellings or iron out errors are sure to beat faster. With the use of REGEX (regular expressions) there are no limits to the imagination.
I don't have to write a lot of texts at work, but I think there is a lot of potential for saving time in this area. Whenever it comes to masses of text, the batch processing should be a very helpful function.
Conclusion: A must for a lot of “foreign text”.
I would say that TextSoap is a tool for everyone who has to work with foreign texts every now and then (or more often). When I write blog posts myself, I still get by quite well and stick to my own ideas about typography. Every now and then I correct product names that I used to write like that, but there is usually not much to clean up.
In my opinion, TextSoap always scores when you have to deal with texts from other people a lot. Here the typographical events are often different from your own ideas and you have to correct certain little things over and over again. Not only are these time-consuming jobs, they are also incredibly boring.
In such cases, TextSoap definitely makes it much easier and faster to reach your goal. With a purchase price of approx. 48 EUR per license (for 2 Macs) the tool is not super cheap, but if you have to do with texts professionally, you will quickly have the price back through saved time inside. I have the tool in my SetApp subscription included, which is of course very practical as it doesn't cost me any additional money.
At this point maybe a reference to another helpful text tool called “Get PlainText“, which I have activated almost constantly in order to have formatted text in the clipboard converted directly into plain text.
A small drawback of TextSoap is perhaps the fact that the tool is not available in German and that with many functions, due to the English function names, you do not understand exactly what they do. To do this, you would have to know the English names for the typographical special characters or elements that are processed here. Nevertheless, the functions can of course be used in any language. Most of them are language-independent and there are only a few that correct, for example, common spelling mistakes in the English language. From my point of view, TextSoap is worth a recommendation despite this small restriction.
Screenshots of the TextSoap settings
For those who want to know a bit more about TextSoap's capabilities and settings options, I've attached a screenshot of each section of TextSoap Version 8's “Preferences” here. Maybe it will help someone to decide if the tool is the right one for their own - often very specific - need.
Related Articles
Jens has been running the blog since 2012. He acts as Sir Apfelot for his readers and helps them with technical problems. In his spare time he rides electric unicycles, takes photos (preferably with the iPhone, of course), climbs around in the Hessian mountains or hikes with the family. His articles deal with Apple products, news from the world of drones or solutions to current bugs.
I tried it this morning and I'm thrilled. Works in all of my typing and email programs. The time saved is really enormous. Above all, the possibility of creating my own workflow, which I can then run over my texts with one click, is simply great. Thanks for the tip.
I'm glad it helps you! I'm surprised myself that I've only just discovered the program. : D