ALS

Companion class ALS

object ALS extends DefaultParamsReadable[ALS] with Logging with Serializable

:: DeveloperApi :: An implementation of ALS that supports generic ID types, specialized for Int and Long. This is exposed as a developer API for users who do need other ID types. But it is not recommended because it increases the shuffle size and memory requirement during training. For simplicity, users and items must have the same type. The number of distinct users/items should be smaller than 2 billion.

Annotations: @DeveloperApi()

Linear Supertypes

Serializable, Serializable, Logging, DefaultParamsReadable[ALS], MLReadable[ALS], AnyRef, Any

Ordering

Alphabetic
By Inheritance

Inherited

ALS
Serializable
Serializable
Logging
DefaultParamsReadable
MLReadable
AnyRef
Any

Hide All
Show All

Visibility

Public
All

Type Members

case class Rating[ID](user: ID, item: ID, rating: Float) extends Product with Serializable
:: DeveloperApi :: Rating class for better code readability.
:: DeveloperApi :: Rating class for better code readability.

Annotations
@DeveloperApi()

Value Members

final def !=(arg0: Any): Boolean

Definition Classes
AnyRef → Any
final def ##(): Int

Definition Classes
AnyRef → Any
final def ==(arg0: Any): Boolean

Definition Classes
AnyRef → Any
final def asInstanceOf[T0]: T0

Definition Classes
Any
def clone(): AnyRef

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@native() @throws( ... )
final def eq(arg0: AnyRef): Boolean

Definition Classes
AnyRef
def equals(arg0: Any): Boolean

Definition Classes
AnyRef → Any
def finalize(): Unit

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( classOf[java.lang.Throwable] )
final def getClass(): Class[_]

Definition Classes
AnyRef → Any
Annotations
@native()
def hashCode(): Int

Definition Classes
AnyRef → Any
Annotations
@native()
def initializeLogIfNecessary(isInterpreter: Boolean, silent: Boolean = false): Boolean

Attributes
protected
Definition Classes
Logging
def initializeLogIfNecessary(isInterpreter: Boolean): Unit

Attributes
protected
Definition Classes
Logging
final def isInstanceOf[T0]: Boolean

Definition Classes
Any
def isTraceEnabled(): Boolean

Attributes
protected
Definition Classes
Logging
def load(path: String): ALS
Reads an ML instance from the input path, a shortcut of read.load(path).
Reads an ML instance from the input path, a shortcut of read.load(path).

Definition Classes
ALS → MLReadable
Annotations
@Since( "1.6.0" )
Note
Implementing classes should override this to be Java-friendly.
def log: Logger

Attributes
protected
Definition Classes
Logging
def logDebug(msg: ⇒ String, throwable: Throwable): Unit

Attributes
protected
Definition Classes
Logging
def logDebug(msg: ⇒ String): Unit

Attributes
protected
Definition Classes
Logging
def logError(msg: ⇒ String, throwable: Throwable): Unit

Attributes
protected
Definition Classes
Logging
def logError(msg: ⇒ String): Unit

Attributes
protected
Definition Classes
Logging
def logInfo(msg: ⇒ String, throwable: Throwable): Unit

Attributes
protected
Definition Classes
Logging
def logInfo(msg: ⇒ String): Unit

Attributes
protected
Definition Classes
Logging
def logName: String

Attributes
protected
Definition Classes
Logging
def logTrace(msg: ⇒ String, throwable: Throwable): Unit

Attributes
protected
Definition Classes
Logging
def logTrace(msg: ⇒ String): Unit

Attributes
protected
Definition Classes
Logging
def logWarning(msg: ⇒ String, throwable: Throwable): Unit

Attributes
protected
Definition Classes
Logging
def logWarning(msg: ⇒ String): Unit

Attributes
protected
Definition Classes
Logging
final def ne(arg0: AnyRef): Boolean

Definition Classes
AnyRef
final def notify(): Unit

Definition Classes
AnyRef
Annotations
@native()
final def notifyAll(): Unit

Definition Classes
AnyRef
Annotations
@native()
def read: MLReader[ALS]
Returns an MLReader instance for this class.
Returns an MLReader instance for this class.

Definition Classes
DefaultParamsReadable → MLReadable
final def synchronized[T0](arg0: ⇒ T0): T0

Definition Classes
AnyRef
def toString(): String

Definition Classes
AnyRef → Any
def train[ID](ratings: RDD[Rating[ID]], rank: Int = 10, numUserBlocks: Int = 10, numItemBlocks: Int = 10, maxIter: Int = 10, regParam: Double = 0.1, implicitPrefs: Boolean = false, alpha: Double = 1.0, nonnegative: Boolean = false, intermediateRDDStorageLevel: StorageLevel = StorageLevel.MEMORY_AND_DISK, finalRDDStorageLevel: StorageLevel = StorageLevel.MEMORY_AND_DISK, checkpointInterval: Int = 10, seed: Long = 0L)(implicit arg0: ClassTag[ID], ord: Ordering[ID]): (RDD[(ID, Array[Float])], RDD[(ID, Array[Float])])
:: DeveloperApi :: Implementation of the ALS algorithm.
:: DeveloperApi :: Implementation of the ALS algorithm.
This implementation of the ALS factorization algorithm partitions the two sets of factors among Spark workers so as to reduce network communication by only sending one copy of each factor vector to each Spark worker on each iteration, and only if needed. This is achieved by precomputing some information about the ratings matrix to determine which users require which item factors and vice versa. See the Scaladoc for InBlock for a detailed explanation of how the precomputation is done.
In addition, since each iteration of calculating the factor matrices depends on the known ratings, which are spread across Spark partitions, a naive implementation would incur significant network communication overhead between Spark workers, as the ratings RDD would be repeatedly shuffled during each iteration. This implementation reduces that overhead by performing the shuffling operation up front, precomputing each partition's ratings dependencies and duplicating those values to the appropriate workers before starting iterations to solve for the factor matrices. See the Scaladoc for OutBlock for a detailed explanation of how the precomputation is done.
Note that the term "rating block" is a bit of a misnomer, as the ratings are not partitioned by contiguous blocks from the ratings matrix but by a hash function on the rating's location in the matrix. If it helps you to visualize the partitions, it is easier to think of the term "block" as referring to a subset of an RDD containing the ratings rather than a contiguous submatrix of the ratings matrix.

Annotations
@DeveloperApi()
final def wait(): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long, arg1: Int): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long): Unit

Definition Classes
AnyRef
Annotations
@native() @throws( ... )

Packages

ALS

Companion class ALS

object ALS extends DefaultParamsReadable[ALS] with Logging with Serializable

Type Members

Value Members

Inherited from Serializable

Inherited from Serializable

Inherited from Logging

Inherited from DefaultParamsReadable[ALS]

Inherited from MLReadable[ALS]

Inherited from AnyRef

Inherited from Any

Members

Packages

ALS 

Companion class ALS

object ALS extends DefaultParamsReadable[ALS] with Logging with Serializable

Type Members

Value Members

Inherited from Serializable

Inherited from Serializable

Inherited from Logging

Inherited from DefaultParamsReadable[ALS]

Inherited from MLReadable[ALS]

Inherited from AnyRef

Inherited from Any

Members

ALS