machine learning - How to use python to tokenize and chunk tagged sentences line by line -

April 15, 2015

i'm linguistist , want use python tokenize sentences in csv document line line , tell tag , token position in tag(b-beginning or i-inside) example below.

"id", "sentence" "1", "<person>claire</person>lived in<location>london uk</location>for<time>2 years</time>" "2", "<location>uk</location> in<location>europe</location>"  ...........  ...........    dataframe = pd.read_csv(document)  sentences = dataframe['sentence']  line in sentences :      #print token position tag   >> claire  b-per  person      lived   null   null        in      null   null     london  b-loc  location     uk      i-loc  location         null   null     2       b-tim   time     years   i-tim   time       uk      b-loc  location          null   null                     in      null   null     europe  b-loc  location

Search This Blog

Silver

machine learning - How to use python to tokenize and chunk tagged sentences line by line -

Comments

Post a Comment

Popular posts from this blog

user interface - How to replace the Python logo in a Tkinter-based Python GUI app? -

android - Get AccessToken using signpost OAuth without opening a browser (Two legged Oauth) -

org.mockito.exceptions.misusing.InvalidUseOfMatchersException: mockito -