| Copyright | (c) 2013-2023 Brendan Hay | 
|---|---|
| License | Mozilla Public License, v. 2.0. | 
| Maintainer | Brendan Hay | 
| Stability | auto-generated | 
| Portability | non-portable (GHC extensions) | 
| Safe Haskell | Safe-Inferred | 
| Language | Haskell2010 | 
Amazonka.Glue.Types.FindMatchesParameters
Description
Synopsis
- data FindMatchesParameters = FindMatchesParameters' {}
- newFindMatchesParameters :: FindMatchesParameters
- findMatchesParameters_accuracyCostTradeoff :: Lens' FindMatchesParameters (Maybe Double)
- findMatchesParameters_enforceProvidedLabels :: Lens' FindMatchesParameters (Maybe Bool)
- findMatchesParameters_precisionRecallTradeoff :: Lens' FindMatchesParameters (Maybe Double)
- findMatchesParameters_primaryKeyColumnName :: Lens' FindMatchesParameters (Maybe Text)
Documentation
data FindMatchesParameters Source #
The parameters to configure the find matches transform.
See: newFindMatchesParameters smart constructor.
Constructors
| FindMatchesParameters' | |
| Fields 
 | |
Instances
newFindMatchesParameters :: FindMatchesParameters Source #
Create a value of FindMatchesParameters with all optional fields omitted.
Use generic-lens or optics to modify other optional fields.
The following record fields are available, with the corresponding lenses provided for backwards compatibility:
$sel:accuracyCostTradeoff:FindMatchesParameters', findMatchesParameters_accuracyCostTradeoff - The value that is selected when tuning your transform for a balance
 between accuracy and cost. A value of 0.5 means that the system balances
 accuracy and cost concerns. A value of 1.0 means a bias purely for
 accuracy, which typically results in a higher cost, sometimes
 substantially higher. A value of 0.0 means a bias purely for cost, which
 results in a less accurate FindMatches transform, sometimes with
 unacceptable accuracy.
Accuracy measures how well the transform finds true positives and true negatives. Increasing accuracy requires more machine resources and cost. But it also results in increased recall.
Cost measures how many compute resources, and thus money, are consumed to run the transform.
$sel:enforceProvidedLabels:FindMatchesParameters', findMatchesParameters_enforceProvidedLabels - The value to switch on or off to force the output to match the provided
 labels from users. If the value is True, the find matches transform
 forces the output to match the provided labels. The results override the
 normal conflation results. If the value is False, the find matches
 transform does not ensure all the labels provided are respected, and the
 results rely on the trained model.
Note that setting this value to true may increase the conflation execution time.
$sel:precisionRecallTradeoff:FindMatchesParameters', findMatchesParameters_precisionRecallTradeoff - The value selected when tuning your transform for a balance between
 precision and recall. A value of 0.5 means no preference; a value of 1.0
 means a bias purely for precision, and a value of 0.0 means a bias for
 recall. Because this is a tradeoff, choosing values close to 1.0 means
 very low recall, and choosing values close to 0.0 results in very low
 precision.
The precision metric indicates how often your model is correct when it predicts a match.
The recall metric indicates that for an actual match, how often your model predicts the match.
$sel:primaryKeyColumnName:FindMatchesParameters', findMatchesParameters_primaryKeyColumnName - The name of a column that uniquely identifies rows in the source table.
 Used to help identify matching records.
findMatchesParameters_accuracyCostTradeoff :: Lens' FindMatchesParameters (Maybe Double) Source #
The value that is selected when tuning your transform for a balance
 between accuracy and cost. A value of 0.5 means that the system balances
 accuracy and cost concerns. A value of 1.0 means a bias purely for
 accuracy, which typically results in a higher cost, sometimes
 substantially higher. A value of 0.0 means a bias purely for cost, which
 results in a less accurate FindMatches transform, sometimes with
 unacceptable accuracy.
Accuracy measures how well the transform finds true positives and true negatives. Increasing accuracy requires more machine resources and cost. But it also results in increased recall.
Cost measures how many compute resources, and thus money, are consumed to run the transform.
findMatchesParameters_enforceProvidedLabels :: Lens' FindMatchesParameters (Maybe Bool) Source #
The value to switch on or off to force the output to match the provided
 labels from users. If the value is True, the find matches transform
 forces the output to match the provided labels. The results override the
 normal conflation results. If the value is False, the find matches
 transform does not ensure all the labels provided are respected, and the
 results rely on the trained model.
Note that setting this value to true may increase the conflation execution time.
findMatchesParameters_precisionRecallTradeoff :: Lens' FindMatchesParameters (Maybe Double) Source #
The value selected when tuning your transform for a balance between precision and recall. A value of 0.5 means no preference; a value of 1.0 means a bias purely for precision, and a value of 0.0 means a bias for recall. Because this is a tradeoff, choosing values close to 1.0 means very low recall, and choosing values close to 0.0 results in very low precision.
The precision metric indicates how often your model is correct when it predicts a match.
The recall metric indicates that for an actual match, how often your model predicts the match.
findMatchesParameters_primaryKeyColumnName :: Lens' FindMatchesParameters (Maybe Text) Source #
The name of a column that uniquely identifies rows in the source table. Used to help identify matching records.