Questions tagged [wikipedia]

Wikipedia is a collaboratively edited, multilingual, free Internet encyclopedia supported by the non-profit Wikimedia Foundation.

Wikipedia is a collaboratively edited, multilingual, free Internet encyclopedia supported by the non-profit Wikimedia Foundation. Wikipedia's 30 million articles in 286 languages, including over 4.2 million in the English Wikipedia, are written collaboratively by volunteers around the world. Almost all of its articles can be edited by anyone having access to the site and not being blocked.[4] It has become the largest and most popular general reference work on the Internet, ranking sixth globally among all websites on Alexa and having an estimated 365 million readers worldwide.

59 questions
10
votes
1 answer

Download Wikipedia articles from a specific category

I know that I can download English Wikipedia Dump, but I was wondering if I can download only articles for a specific category-subject. For instance, can I download articles related to Mathematics or Biology or Medicine only? If this is not…
Sfinos
  • 305
  • 3
  • 6
9
votes
3 answers

How to get daily updates from Wikipedia?

I installed media-wiki on ec2 machine and loaded the Wikipedia page-articles dump content and other relevant data into that. I want to update the data regularly(daily) but I didn't get any resource for that. Is there any source from where I can get…
vinod
  • 91
  • 3
4
votes
1 answer

Geotagged wiki data

For a research purpose how can I extract geotagged wiki data (containing page id or titles for articles that refer to particular geolocation) for a city in England.
naw16
  • 41
  • 1
3
votes
2 answers

Download wikipedia dump and save in raw text form

I have been trying to use Wikipedia text data for my personal research. I know that crawling is not good for the Wikipedia server so I downloaded a big XML file from https://dumps.wikimedia.org/jawiki/latest/, especially I downloaded 3 files…
Jin Sakuma
  • 31
  • 2
3
votes
0 answers

How are Wikipedia subcategories meant, semantically?

Let's have a look at the category https://en.wikipedia.org/wiki/Category:British_politicians Theresa May is not in it. So let's see: Theresa May is in Category:20th-century_British_women_politicians which is in…
2
votes
1 answer

Wikipedia database: categories and category mapping across languages

I've imported the wikipedia database in four languages with the goal of running some machine learning algorithms on it for text classification. The import doesn't populate the "category" table though. Am I missing something? I would also like to…
podzway
  • 23
  • 2
2
votes
2 answers

Where can I find the source code of Wikipedia / Wiktionary templates?

Wikipedia and its sister sites make heavy use of templates. I want to find the source code behind those templates, i.e. the code that renders the HTML from a given template reference. Looking here, I found a few files which seem to deal with…
1
vote
1 answer

How to use multiple wikipedia categories for Quick Intersection?

I'm using http://tools.wmflabs.org/quick-intersection/index.php to get a list of articles within categories but I have a list of over 35 categories and I'd like to use Quick Intersection to get the articles. Problem I have found, and perhaps I just…
Bart
  • 13
  • 2
1
vote
2 answers

Converting Wikipedia URL to Wikipedia Page ID

I've linked phrases in texts to entities in Wikipedia: Going over the bridge, coming from Aliante Casino, you cant miss the nice view of the href="http://en.wikipedia.org/wiki/Waterfall">waterfall that is at the forefron Now, I would like to…
dzieciou
  • 233
  • 1
  • 8