profile

The Profile object is used for the search of the gene with Hmmer. A “Profile” must match a HMM protein profile file, which name is based on the profile name. For instance, the gspG gene has the corresponding “gspG.hmm” profile file provided at a dedicated location.

profile API reference

ProfileFactory

class macsypy.profile.ProfileFactory(cfg)[source]

Build and store all Profile objects. Profiles must not be instanciated directly. The profile_factory must be used. The profile_factory ensures there is only one instance of profile for a given name. To get a profile, use the method get_profile. If the profile is already cached, this instance is returned. Otherwise a new profile is built, stored in the profile_factory and then returned.

__init__(cfg)[source]

Initialize self. See help(type(self)) for accurate signature.

__weakref__

list of weak references to the object (if defined)

get_profile(gene, model_location)[source]
Parameters:
  • gene (macsypy.gene.Gene or macsypy.gene.Homolog or macsypy.gene.Analog object) – the gene associated to this profile
  • model_location (macsypy.registries.ModelLocation object.) – The where to get the profile
Returns:

the profile corresponding to the name. If the profile already exists, return it. Otherwise build it, store it and return it.

Return type:

macsypy.profile.Profile object

Profile

class macsypy.profile.Profile(gene, cfg, path)[source]

Handle a HMM protein profile

__init__(gene, cfg, path)[source]
Parameters:
  • gene (macsypy.secretion.Gene object) – the gene corresponding to this profile
  • cfg (macsypy.config.Config object) – the configuration
  • path (string) – the path to the hmm profile.
__len__()[source]
Returns:the length of the HMM protein profile
Return type:int
__str__()[source]

Print the name of the corresponding gene and the path to the HMM profile.

__weakref__

list of weak references to the object (if defined)

_profile_features()[source]

Parse the HMM profile to extract the length and the presence of GA bit threshold

Returns:the length, presence of ga bit threshold
Return type:tuple(int length, bool ga_threshold)
execute(cpu=1)[source]

Launch the Hmmer search (hmmsearch executable) with this profile

Parameters:cpu (int) – the number of cpu to use for hmmsearch (must be >= 1)
Returns:an object storing information on the results of the HMM search (HMMReport)
Return type:macsypy.report.HMMReport object