Configuration API¶
The configuration system provides type-safe configuration classes for data generation.
Configuration Schemas¶
Molecular Configuration¶
synthbiodata.config.schema.v1.molecular.MolecularConfig
¶
Bases: BaseConfig
Configuration schema for molecular descriptor data.
This class defines the configuration options for generating synthetic molecular descriptor data, including ranges and distributions for molecular weight (MW), LogP, and TPSA, as well as target protein family probabilities.
Parameters¶
mw_mean : float Mean molecular weight of generated molecules. mw_std : float Standard deviation of molecular weight. mw_min : float Minimum allowed molecular weight. mw_max : float Maximum allowed molecular weight. logp_mean : float Mean LogP (octanol-water partition coefficient) value. logp_std : float Standard deviation of LogP. logp_min : float Minimum allowed LogP value. logp_max : float Maximum allowed LogP value. tpsa_mean : float Mean topological polar surface area (TPSA) value. tpsa_std : float Standard deviation of TPSA. tpsa_min : float Minimum allowed TPSA value. tpsa_max : float Maximum allowed TPSA value. target_families : list of str List of target protein families to sample from. target_family_probs : list of float Probability distribution for selecting each target family.
Examples¶
config = MolecularConfig() config.mw_mean 350.0 config.target_families ['GPCR', 'Kinase', 'Protease', 'Ion Channel', 'Nuclear Receptor']
Source code in src/synthbiodata/config/schema/v1/molecular.py
12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 |
|
Functions¶
validate_ranges() -> MolecularConfig
¶
Validate molecular descriptor ranges and standard deviations.
Source code in src/synthbiodata/config/schema/v1/molecular.py
ADME Configuration¶
synthbiodata.config.schema.v1.adme.ADMEConfig
¶
Bases: BaseConfig
Configuration for ADME data generation.
Source code in src/synthbiodata/config/schema/v1/adme.py
Functions¶
validate_parameters() -> ADMEConfig
¶
Validate ADME parameters and standard deviations.
Source code in src/synthbiodata/config/schema/v1/adme.py
Base Configuration¶
synthbiodata.config.base.BaseConfig
¶
Bases: BaseModel
Base configuration for all data types.
Source code in src/synthbiodata/config/base.py
Functions¶
validate_splits() -> BaseConfig
¶
Validate dataset split ratios.