2

I need to get just the text and math expressions (ignoring tables, pictures and styling) from a set of LaTeX documents and represent them in html.

Looks like plasTeX and MathJaX are enough for this task.

As I understand, after plasTeX has parsed the document, I would need to get all text nodes of the document and all nodes with math. For the math nodes I would try to preserve their LaTeX source. Is it possible, using plasTeX, to get LaTeX source of a math expression with all the commands already applied?

A Feldman
  • 3,930
  • Welcome to TeX.sx! Do you want everything removed which is inside a command or an environment, or is it just for some specific commands/environments (tables, etc)? – masu Oct 25 '13 at 14:20
  • Thanks! I want to get all math expressions from LaTeX document, and if the author created some new commands/environments in the document, I want to get them unfolded. – kseniyam Oct 25 '13 at 17:14
  • @masu , for example (the example is taken from http://en.wikibooks.org/wiki/LaTeX/Macros), if if a have in the document these lines \newcommand{\wbal}{The Wikibook about \LaTeX} This is ‘‘\wbal'' \ldots{} ‘‘\wbal'' I want to get them as just: This is ‘‘The Wikibook about \LaTeX''. – kseniyam Oct 25 '13 at 17:20

1 Answers1

-1

Yes, there is. Run Plastex with theme:minimal.

See answer to question: Is there a PlasTeX custom-renderer which can produce ePub (or nearly ePub)?

Echeban
  • 107
  • Can you expand on this for the particular situation here? Otherwise this is just a comment. – Andrew Swann Oct 06 '15 at 11:23
  • This should be fleshed out with a MWE showing the solution to the OP's question. Since it is the only answer, this is particularly important. – A Feldman Apr 30 '16 at 11:31