Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Automatically identify and categorize place names #49

Open
JonathanReeve opened this issue Dec 7, 2018 · 10 comments
Open

Automatically identify and categorize place names #49

JonathanReeve opened this issue Dec 7, 2018 · 10 comments

Comments

@JonathanReeve
Copy link
Member

  1. Do some NER to find place names (SpaCy? Entity type: place) Here is some documentation from SpaCy
  2. Automatically look up that place name in DBPedia, using the DBPedia API and the Python requests library.
  1. Use that data to include latitude, longitude, description, etc. for each (real) place.
  2. Make a heuristic for categorizing place names.
@JonathanReeve
Copy link
Member Author

@matthewkumar, want to give this a shot?

@JonathanReeve
Copy link
Member Author

This is very closely related to #27, and is partially complete in #41.

@JonathanReeve
Copy link
Member Author

JonathanReeve commented Dec 7, 2018

See also open-editions/corpus-joyce-portrait-TEI#90. It's best to have place name metadata in a separate file, say, places.xml, and each place name in the text is referenced by an xml id. Here's some TEI documentation

@JonathanReeve
Copy link
Member Author

We should also integrate @muziejus's Wandering Rocks data: https://github.com/muziejus/wandering-rocks/blob/master/data/instances.csv

@workshub
Copy link

workshub bot commented Dec 27, 2021

A user started working on this issue via WorksHub.

@workshub
Copy link

workshub bot commented Jan 1, 2022

@Freitas-Mp started working on this issue via WorksHub.

@JonathanReeve
Copy link
Member Author

Hi @Freitas-Mp! Could you tell me what you had in mind for this issue? I may be able to help you think through it.

@workshub
Copy link

workshub bot commented Jan 17, 2022

@baoduong started working on this issue via WorksHub.

@workshub
Copy link

workshub bot commented Jan 17, 2022

A user started working on this issue via WorksHub.

@workshub
Copy link

workshub bot commented Jan 22, 2022

@bragaji started working on this issue via WorksHub.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant