I would like to address an issue here that matters a lot to me that is: Why do the arXiv admins not consider to introduce an option to submit XeLaTeX generated PDFs? Or in a broader sense: non pdflatex generated documents.
A reasonable argument I can think of is that they might be afraid of problems arising from licensing issues (concerning the fonts used). But that is a problem that could be solved.
I noticed that the support team puts an enormous effort in detecting if a PDF document was typeset with any TeX typesetting engines and eventually blocking it from being uploaded, also the sources as long as it is not compatible with the (pdf)latex compiler, although people (especially linguists see below) are asking for this for years now. So why not allowing researchers to upload their PDF?
There are a vast number of significant advantages using not the old-fashioned (pdf)LaTeX typesetting engines:
- most notable to me: a proper unicode support, i.e. proper UTF-8 support out of the box
- OpenType, TrueType font support (Multiple Master Fonts and other modern font technologies like Graphite and AAT) via the fontspec package
- ligatures and contextual alternates
- in addition to (1): Proper support for Japanese, Chinese, Korean etc.
- polyglossia let’s us effortlessly change between different languages, even variants of them, within the document.
The first five are mainly typesetting arguments, so they are also arguments for researchers, namely linguists, that often have to readapt their documents to XeLaTeX for publishing, but, on the other hand, are compelled to submit their work to the heavily used platform to be perceived by a broader audience. I really dislike to say, but I have the slight impression that historically TeX was written from a US centred perspective: Writing some Eastern European or Turkish names correctly can be quite challenging, e.g. try Nikodyḿ, Žižek or Aydoğmuş, Nallıhan.
But also for me, coming from a mathematical background, there are huge advantages, namely:
The unicode-math package:
- Improved readability options, especially in math mode: Using
\symnormal, \symliteral, \symup, \symbfup, \symbfitetc. I can easily manipulate a greek letter to be displayed bold, upshape, sans serif bold italic or whatever I like. Very helpful to distinguish, e.g., an arbitrary δ from the codifferential. It’s a pain in the ass (please excuse my language here) to achieve the same output with pdflatex. E.g. for the famous indicator function, you can just type “$\symbb 1$”.
- Improved readability options, especially in math mode: Using
Out of the box support for the vast majority of symbols used in present mathematics, here is complete a list.
- For example, including
\Vbarfor independence in probability theory,\smalltrianglerightto denote a left action, the musical isomorphisms\flatand\sharp,\subsetcircfor open subset, all dice faces, etc. etc.
- For example, including
- More math fonts and options how to use them available: different fonts for different symbols can be chosen, the size of single objects can be set independently etc.
Some completely unnecessary crooks often help to somehow implement all the features mentioned in 1.-8. The question for me is: Why? It is like using obsolete/outdated version of programming language, say C++, because you do not "like" Lambda Expressions.
- I will not go to hard on this, because I am not an expert, but for many people the possibilities of LuaLaTeX should be also mentioned.
A claim often made is that publishers also use their own template, so people will have to adapt eventually. But (1) why shall I not be allowed to format my work as it fits my needs and aesthetic perception? Especially considering that not using arXiv is not an option anymore. (2) Then you are still left with some disciplines, as I mentioned linguists or philologist, that rely on a proper unicode support.
Edit: Tried to clear up that I am mainly concerned with the generated XeLaTeX PDF files not being able to upload the sources.