mailing-list for TeXmacs Users

Text archives Help


Re: Fast conversion of TeXmacs files to plain text


Chronological Thread 
  • From: Sam Liddicott <address@hidden>
  • To: Ingolf Schäfer <address@hidden>
  • Cc: TeXmacs user list <address@hidden>
  • Subject: Re: Fast conversion of TeXmacs files to plain text
  • Date: Tue, 4 Feb 2020 13:14:48 +0000

It ought to be possible to batch convert a bunch of files from one invocation of texmacs, by using a sequence of command line arguments to pass texmacs commands

But you do need to invoke the full texmacs because it's impossible otherwise to know how various macros might render as text.

Sam

On Tue, 4 Feb 2020, 00:22 Ingolf Schäfer, <address@hidden> wrote:
Hi,

I have been playing around with Recoll a bit. Recoll is an indexing software that allows to search indexed files fast on multiple platforms. While it is trivially simple to just treat TeXmacs files like ordinary text files, that is not the most appropriate way to index them.

The most natural candidate for a "detexmacs" command that strips formatting data and such out of the text would be to call texmacs with the -c and -q option.

Unfortunately, that process is rather slow and takes a couple of seconds for each file, even if it is just a couple of lines. texmacs -q still does initialization, starting the GUI window, converting  and closing the window again.

While this is kind of ok, when you have only a couple of documents, I'd rather have a quicker and probably more light weight way to do that conversion.

Has someone worked in that direction already or a simple tip?

Thanks in advance,

Ingolf




Archive powered by MHonArc 2.6.19.

Top of page