Package TEES :: Package Core :: Module ExampleUtils
[hide private]

Module ExampleUtils

source code

Tools for writing and reading classifier example files

These functions read and write machine learning example files and convert examples into final data forms. The memory representation for each example is a 4-tuple (or list) of the format: (id, class, features, extra). id is a string, class is an int (-1 or +1 for binary) and features is a dictionary of int:float -pairs, where the int is the feature id and the float is the feature value. Extra is a dictionary of String:String pairs, for additional information about the examples.

Functions [hide private]
 
gen2iterable(genfunc)
Makes a multi-use iterator generator.
source code
 
isDuplicate(example1, example2) source code
 
removeDuplicates(examples)
removes all but one of the examples that have the same class and identical feature vectors
source code
 
normalizeFeatureVectors(examples) source code
 
copyExamples(examples) source code
 
appendExamples(examples, file) source code
 
appendExamplesBinary(examples, file) source code
 
writeExamples(examples, filename, commentLines=None) source code
 
writePredictions(predictions, exampleFileName) source code
 
getIdsFromFile(filename) source code
 
readExamples(*args, **kwargs) source code
 
makeCorpusDivision(corpusElements, fraction=0.5, seed=0) source code
 
makeCorpusFolds(corpusElements, folds=10) source code
 
makeExampleDivision(examples, fraction=0.5) source code
 
makeExampleFolds(examples, folds=10) source code
 
makeDivision(ids, fraction=0.5, seed=0) source code
 
makeFolds(ids, folds=10) source code
 
divideExamples(examples, division=None) source code
 
divideExampleFile(exampleFileName, division, outputDir) source code
 
loadPredictions(*args, **kwargs) source code
Variables [hide private]
  __package__ = 'TEES.Core'
Function Details [hide private]

gen2iterable(genfunc)

source code 

Makes a multi-use iterator generator. See http://bugs.python.org/issue5973 for details.

readExamples(*args, **kwargs)

source code 
Decorators:
  • @gen2iterable

loadPredictions(*args, **kwargs)

source code 
Decorators:
  • @gen2iterable