|
getTriggers(corpus)
Returns a dictionary of "entity type"->"entity
text"->"count" |
source code
|
|
|
getDistribution(trigDict)
Converts a dictionary of "entity type"->"entity
text"->"count" to "entity
text"->"entity type"->"(count,
fraction)" |
source code
|
|
|
|
|
|
|
|
|
findHeads(corpus,
stringsFrom,
methods,
parse,
tokenization) |
source code
|
|
|
mapSplits(splits,
string,
stringOffset)
Maps substrings to a string, and stems them |
source code
|
|
|
findHeadsDictionary(corpus,
stringsFrom,
parse,
tokenization) |
source code
|
|
|
|
|
getEntityHeadToken(entity,
tokens,
tokenHeadScores) |
source code
|
|
|
findHeadToken(candidateTokens,
tokenHeadScores)
Select the candidate token that is closest to the root of the subtree
of the depencdeny parse to which the candidate tokens belong to. |
source code
|
|
|
getTokenHeadScores(tokens,
dependencies,
sentenceId=None)
A head token is chosen using a heuristic that prefers tokens closer
to the root of the dependency parse. |
source code
|
|