msmbuilder.featurizer.
ContactFeaturizer
(contacts='all', scheme='closest-heavy', ignore_nonprotein=True)¶Featurizer based on residue-residue distances.
This featurizer transforms a dataset containing MD trajectories into a vector dataset by representing each frame in each of the MD trajectories by a vector of the distances between pairs of amino-acid residues.
The exact method for computing the the distance between two residues
is configurable with the scheme
parameter.
Parameters: |
|
---|
__init__
(contacts='all', scheme='closest-heavy', ignore_nonprotein=True)¶Methods
__init__ ([contacts, scheme, ignore_nonprotein]) |
|
describe_features (traj) |
Return a list of dictionaries describing the contacts features. |
featurize (traj) |
|
fit (traj_list[, y]) |
|
fit_transform (X[, y]) |
Fit to data, then transform it. |
get_params ([deep]) |
Get parameters for this estimator. |
partial_transform (traj) |
Featurize an MD trajectory into a vector space derived from |
set_params (\*\*params) |
Set the parameters of this estimator. |
summarize () |
Return some diagnostic summary statistics about this Markov model |
transform (traj_list[, y]) |
Featurize a several trajectories. |
describe_features
(traj)¶Return a list of dictionaries describing the contacts features.
Parameters: | traj (mdtraj.Trajectory) – The trajectory to describe |
---|---|
Returns: | feature_descs – Dictionary describing each feature with the following information
about the atoms participating in each dihedral
|
Return type: | list of dict |
fit_transform
(X, y=None, **fit_params)¶Fit to data, then transform it.
Fits transformer to X and y with optional parameters fit_params and returns a transformed version of X.
Parameters: |
|
---|---|
Returns: | X_new – Transformed array. |
Return type: | numpy array of shape [n_samples, n_features_new] |
get_params
(deep=True)¶Get parameters for this estimator.
Parameters: | deep (boolean, optional) – If True, will return the parameters for this estimator and contained subobjects that are estimators. |
---|---|
Returns: | params – Parameter names mapped to their values. |
Return type: | mapping of string to any |
partial_transform
(traj)¶Featurize an MD trajectory into a vector space derived from residue-residue distances
Parameters: | traj (mdtraj.Trajectory) – A molecular dynamics trajectory to featurize. |
---|---|
Returns: | features – A featurized trajectory is a 2D array of shape (length_of_trajectory x n_features) where each features[i] vector is computed by applying the featurization function to the `i`th snapshot of the input trajectory. |
Return type: | np.ndarray, dtype=float, shape=(n_samples, n_features) |
See also
transform()
set_params
(**params)¶Set the parameters of this estimator.
The method works on simple estimators as well as on nested objects
(such as pipelines). The latter have parameters of the form
<component>__<parameter>
so that it’s possible to update each
component of a nested object.
Returns: | |
---|---|
Return type: | self |
summarize
()¶Return some diagnostic summary statistics about this Markov model
transform
(traj_list, y=None)¶Featurize a several trajectories.
Parameters: | traj_list (list(mdtraj.Trajectory)) – Trajectories to be featurized. |
---|---|
Returns: | features – The featurized trajectories. features[i] is the featurized version of traj_list[i] and has shape (n_samples_i, n_features) |
Return type: | list(np.ndarray), length = len(traj_list) |