I'm looking for Arabic text annotated with part of speech tags, IOB/chunk tags, and/or parse trees. Anything that could be used with NLTK for training arabic taggers and parsers.

asked 04 Apr '11, 00:56

japerk's gravatar image

japerk
51229
accept rate: 0%


Check the Arabic Treebanks in the LDC catalogue, e.g., http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC2010T13. The licensing fee depends on your type of affiliation.

Cheers,

Fredrik

link

answered 04 Apr '11, 13:46

fredriko's gravatar image

fredriko
71124
accept rate: 100%

edited 09 Apr '11, 11:42

rgrp's gravatar image

rgrp ♦♦
501122027

Your answer
toggle preview

Follow this question

By Email:

Once you sign in you will be able to subscribe for any updates here

By RSS:

Answers

Answers and Comments

Markdown Basics

  • *italic* or _italic_
  • **bold** or __bold__
  • link:[text](http://url.com/ "title")
  • image?![alt text](/path/img.jpg "title")
  • numbered list: 1. Foo 2. Bar
  • to add a line break simply add two spaces to where you would like the new line to be.
  • basic HTML tags are also supported

Tags:

×3
×2
×1
×1
×1

Asked: 04 Apr '11, 00:56

Seen: 1,475 times

Last updated: 12 Jul, 08:10

powered by OSQA