Object

org.tresamigos.smv.matcher_old

StringMetricUDFs

Related Doc: package matcher_old

Permalink

object StringMetricUDFs

StringMetricUDFs is a collection of string similarity measures Implemented using Scala StringMetrics lib

UDFs with Boolean returns

- soundexMatch: ture if the Soundex of the strings matched exactly

UDFs with Float returns

N-gram based measures

- nGram2: 2-gram with formula (number of overlaped gramCnt)/max(s1.gramCnt, s2.gramCnt) - nGram3: 3-gram with the same formula above - diceSorensen: 2-gram with formula (2 * number of overlaped gramCnt)/(s1.gramCnt + s2.gramCnt)

Editing distance measures

- levenshtein - jaroWinkler

Linear Supertypes
AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. StringMetricUDFs
  2. AnyRef
  3. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Value Members

  1. final def !=(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  4. final def asInstanceOf[T0]: T0

    Permalink
    Definition Classes
    Any
  5. def clone(): AnyRef

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  6. val diceSorensen: UserDefinedFunction

    Permalink

    UDF Return a float.

    UDF Return a float. 0 is no match, and 1 is full match

  7. final def eq(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  8. def equals(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  9. def finalize(): Unit

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  10. final def getClass(): Class[_]

    Permalink
    Definition Classes
    AnyRef → Any
  11. def hashCode(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  12. final def isInstanceOf[T0]: Boolean

    Permalink
    Definition Classes
    Any
  13. val jaroWinkler: UserDefinedFunction

    Permalink

    UDF Return a float.

    UDF Return a float. 0 is no match, and 1 is full match

  14. val levenshtein: UserDefinedFunction

    Permalink

    UDF Return a float.

    UDF Return a float. 0 is no match, and 1 is full match

  15. val nGram2: UserDefinedFunction

    Permalink

    UDF Return a float.

    UDF Return a float. 0 is no match, and 1 is full match

  16. val nGram3: UserDefinedFunction

    Permalink

    UDF Return a float.

    UDF Return a float. 0 is no match, and 1 is full match

  17. final def ne(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  18. final def notify(): Unit

    Permalink
    Definition Classes
    AnyRef
  19. final def notifyAll(): Unit

    Permalink
    Definition Classes
    AnyRef
  20. val soundexMatch: UserDefinedFunction

    Permalink

    UDF Return a boolean.

    UDF Return a boolean. True if Soundex of the two string are exectly matched

  21. final def synchronized[T0](arg0: ⇒ T0): T0

    Permalink
    Definition Classes
    AnyRef
  22. def toString(): String

    Permalink
    Definition Classes
    AnyRef → Any
  23. final def wait(): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  24. final def wait(arg0: Long, arg1: Int): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  25. final def wait(arg0: Long): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )

Inherited from AnyRef

Inherited from Any

Ungrouped