jelli.core.experimental_correlations

`ExperimentalCorrelations`

A class to represent experimental correlations.

Parameters:

Name	Type	Description	Default
`hash_val`	`str`	A unique hash value representing the combination of measurements and observables.	required
`data_type`	`str`	The type of data stored in the instance. It can be `correlations`, `central`, or `uncertainties`.	required
`data`	`ndarray`	The data array containing the correlation matrix, central values, or uncertainties.	required
`row_names`	`Iterable[str]`	The names of the observables corresponding to the rows of the data array.	required
`col_names`	`Iterable[str]`	The names of the observables corresponding to the columns of the data array.	required

Attributes:

Name	Type	Description
`hash_val`	`str`	A unique hash value representing the combination of measurements and observables.
`data_type`	`str`	The type of data stored in the instance. It can be `correlations`, `central`, or `uncertainties`.
`data`	`ndarray`	The data array containing the correlation matrix, central values, or uncertainties.
`row_names`	`Iterable[str]`	The names of the observables corresponding to the rows of the data array.
`col_names`	`Iterable[str]`	The names of the observables corresponding to the columns of the data array.
`_instances`	`Dict[str, Dict[str, ExperimentalCorrelations]]`	A class-level dictionary to hold instances of `ExperimentalCorrelations` for each data type.
`_covariance_scaled`	`Dict[str, ndarray]`	A class-level dictionary to hold scaled covariance matrices for each unique combination of measurements and observables.
`_observable_names`	`Iterable[Iterable[str]]`	A class-level iterable to hold lists of observable names for each observable sector.

Methods:

Name	Description
`load`	Load observable names from all correlated observable sectors and initialize class-level attributes.
`compute`	Compute the correlation matrices, central values, and uncertainties from the specified measurements.
`get_data`	Retrieve the data array for the specified data type and combination of observables and measurements.
`get_cov_scaled`	Retrieve the scaled covariance matrix for the specified combination of observables and measurements.

Examples:

Load observable names from all correlated observable sectors:

>>> ExperimentalCorrelations.load()

Compute the correlation matrices, central values, and uncertainties from the specified measurements:

>>> ExperimentalCorrelations.compute(include_measurements=['measurement1', 'measurement2'])

Retrieve the data array for a specific data type and combination of observables and measurements:

>>> data = ExperimentalCorrelations.get_data(data_type='correlations', include_measurements=['measurement1'], row_names=['observable1'], col_names=['observable2'])

Retrieve the scaled covariance matrix for a specific combination of observables and measurements:

>>> cov_scaled = ExperimentalCorrelations.get_cov_scaled(include_measurements=['measurement1'], row_names=['observable1'], col_names=['observable2'], std_exp_scaled_row=np.array([0.5]), std_exp_scaled_col=np.array([1.2]))

Source code in jelli/core/experimental_correlations.py

class ExperimentalCorrelations:
    '''
    A class to represent experimental correlations.

    Parameters
    ----------
    hash_val : str
        A unique hash value representing the combination of measurements and observables.
    data_type : str
        The type of data stored in the instance. It can be `correlations`, `central`, or `uncertainties`.
    data : np.ndarray
        The data array containing the correlation matrix, central values, or uncertainties.
    row_names : Iterable[str]
        The names of the observables corresponding to the rows of the data array.
    col_names : Iterable[str]
        The names of the observables corresponding to the columns of the data array.

    Attributes
    ----------
    hash_val : str
        A unique hash value representing the combination of measurements and observables.
    data_type : str
        The type of data stored in the instance. It can be `correlations`, `central`, or `uncertainties`.
    data : np.ndarray
        The data array containing the correlation matrix, central values, or uncertainties.
    row_names : Iterable[str]
        The names of the observables corresponding to the rows of the data array.
    col_names : Iterable[str]
        The names of the observables corresponding to the columns of the data array.
    _instances : Dict[str, Dict[str, 'ExperimentalCorrelations']]
        A class-level dictionary to hold instances of `ExperimentalCorrelations` for each data type.
    _covariance_scaled : Dict[str, jnp.ndarray]
        A class-level dictionary to hold scaled covariance matrices for each unique combination of measurements and observables.
    _observable_names : Iterable[Iterable[str]]
        A class-level iterable to hold lists of observable names for each observable sector.

    Methods
    -------
    load() -> None
        Load observable names from all correlated observable sectors and initialize class-level attributes.
    compute(include_measurements: Iterable[str], n_samples: int = int(1e6), seed: int = None) -> None
        Compute the correlation matrices, central values, and uncertainties from the specified measurements.
    get_data(data_type: str, include_measurements: Iterable[str], row_names: Iterable[str], col_names: Optional[Iterable[str]] = []) -> Optional[jnp.ndarray]
        Retrieve the data array for the specified data type and combination of observables and measurements.
    get_cov_scaled(include_measurements: Iterable[str], row_names: Iterable[str], col_names: Iterable[str], std_exp_scaled_row: np.ndarray, std_exp_scaled_col: np.ndarray) -> jnp.ndarray
        Retrieve the scaled covariance matrix for the specified combination of observables and measurements.

    Examples
    --------
    Load observable names from all correlated observable sectors:

    >>> ExperimentalCorrelations.load()

    Compute the correlation matrices, central values, and uncertainties from the specified measurements:

    >>> ExperimentalCorrelations.compute(include_measurements=['measurement1', 'measurement2'])

    Retrieve the data array for a specific data type and combination of observables and measurements:

    >>> data = ExperimentalCorrelations.get_data(data_type='correlations', include_measurements=['measurement1'], row_names=['observable1'], col_names=['observable2'])

    Retrieve the scaled covariance matrix for a specific combination of observables and measurements:

    >>> cov_scaled = ExperimentalCorrelations.get_cov_scaled(include_measurements=['measurement1'], row_names=['observable1'], col_names=['observable2'], std_exp_scaled_row=np.array([0.5]), std_exp_scaled_col=np.array([1.2]))
    '''

    _instances: Dict[str, Dict[str, 'ExperimentalCorrelations']] = {
        'correlations': {},
        'central': {},
        'uncertainties': {}
    }  # Dictionary to hold instances of ExperimentalCorrelations
    _covariance_scaled: Dict[str, jnp.ndarray] = {}
    _observable_names: Iterable[Iterable[str]] = []

    def __init__(
        self,
        hash_val: str,
        data_type: str,
        data: np.ndarray,
        row_names: Iterable[str],
        col_names: Iterable[str],
    ) -> None:
        '''
        Initialize an instance of the ExperimentalCorrelations class.

        Parameters
        ----------
        hash_val : str
            A unique hash value representing the combination of measurements and observables.
        data_type : str
            The type of data stored in the instance. It can be `correlations`, `central`, or `uncertainties`.
        data : np.ndarray
            The data array containing the correlation matrix, central values, or uncertainties.
        row_names : Iterable[str]
            The names of the observables corresponding to the rows of the data array.
        col_names : Iterable[str]
            The names of the observables corresponding to the columns of the data array.

        Returns
        -------
        None

        Examples
        --------
        Initialize an instance of the ExperimentalCorrelations class:

        >>> exp_corr = ExperimentalCorrelations(
        ...     hash_val='unique_hash_value',
        ...     data_type='correlations',
        ...     data=np.array([[1.0, 0.5], [0.5, 1.0]]),
        ...     row_names=['observable1', 'observable2'],
        ...     col_names=['observable1', 'observable2']
        ... )
        '''
        self.hash_val = hash_val
        self.data_type = data_type
        self.data = data
        self.row_names = row_names
        self.col_names = col_names
        self._instances[data_type][hash_val] = self

    @classmethod
    def load(cls) -> None:
        '''
        Load observable names from all correlated observable sectors and initialize class-level attributes.

        Returns
        -------
        None

        Examples
        --------
        Load observable names from all correlated observable sectors:

        >>> ExperimentalCorrelations.load()
        '''
        observable_names = []
        for observable_sector in ObservableSector.get_all():
            if observable_sector.observable_uncertainties is not None:
                observable_names.append(observable_sector.observable_names)
        cls._observable_names = observable_names
        cls._instances = {
            'correlations': {},
            'central': {},
            'uncertainties': {}
        }
        cls._covariance_scaled = {}

    @classmethod
    def compute(
        cls,
        include_measurements: Iterable[str],
        n_samples: int = int(1e6),
        seed: int = None
    ) -> None:
        '''
        Compute the correlation matrices, central values, and uncertainties from the specified measurements.

        Parameters
        ----------
        include_measurements : Iterable[str]
            A list of measurement names to include in the computation.
        n_samples : int, optional
            The number of samples to draw for numerical approximations of non-Gaussian distributions. Default is 1e6.
        seed : int, optional
            A random seed for reproducibility of the samples. Default is None.

        Returns
        -------
        None

        Examples
        --------
        Compute the correlation matrices, central values, and uncertainties from the specified measurements:

        >>> ExperimentalCorrelations.compute(include_measurements=['measurement1', 'measurement2'])
        '''

        observables = list(chain.from_iterable(cls._observable_names))

        # get univariate constraints and combine them for each observable
        constraints_univariate = Measurement.get_constraints(
            observables,
            distribution_types=[k for k in logpdf_functions.keys() if k not in ['MultivariateNormalDistribution']],
            include_measurements=include_measurements,
            )
        constraints_list = []
        for observable in observables:
            constraints_observable = {}
            for dist_type, dist_info in constraints_univariate.items():
                mask = dist_info['observables'] == observable
                if np.any(mask):
                    constraints_observable[dist_type] = {
                        k: v[mask] for k, v in dist_info.items()
                    }
            if constraints_observable:
                constraints_list.append(constraints_observable)
        constraints_univariate = Measurement.combine_constraints(constraints_list)

        # construct covariance matrix and mean vector from univariate constraints
        cov = np.diag([np.inf] * len(observables))
        mean = np.zeros(len(observables))
        for dist_type, dist_info in constraints_univariate.items():
            if dist_type == 'NormalDistribution' or dist_type == 'HalfNormalDistribution':  # replace HalfNormalDistribution with zero-mean NormalDistribution
                central_value = dist_info['central_value']
                standard_deviation = dist_info['standard_deviation']
                observable_indices = dist_info['observable_indices']
            else:  # numerically obtain the Gaussian approximation
                samples = get_distribution_samples(dist_type, dist_info, n_samples, seed)
                central_value = np.mean(samples, axis=1)
                standard_deviation = np.std(samples, axis=1)
                observable_indices = dist_info['observable_indices']
            cov[observable_indices, observable_indices] = standard_deviation**2
            mean[observable_indices] = central_value

        # get multivariate constraints
        constraints_multivariate = Measurement.get_constraints(
            observables,
            distribution_types=['MultivariateNormalDistribution'],
            include_measurements=include_measurements,
        )
        if constraints_multivariate:
            # combine all covariance matrices and mean vectors using the weighted average
            weights = [np.diag(1/np.diag(cov))]
            means = [mean]
            constraints_multivariate = constraints_multivariate['MultivariateNormalDistribution']
            for i in range(len(constraints_multivariate['central_value'])):
                weight_i = np.zeros((len(observables), len(observables)))
                mean_i = np.zeros(len(observables))
                observable_indices_i = constraints_multivariate['observable_indices'][i]
                central_value_i = constraints_multivariate['central_value'][i]
                standard_deviation_i = constraints_multivariate['standard_deviation'][i]
                inverse_correlation_i = constraints_multivariate['inverse_correlation'][i]
                weight_i[np.ix_(observable_indices_i, observable_indices_i)] = inverse_correlation_i / np.outer(standard_deviation_i, standard_deviation_i)
                mean_i[observable_indices_i] = central_value_i
                weights.append(weight_i)
                means.append(mean_i)
            inv_cov = np.sum(weights, axis=0)
            # regularize inversion asuming inv_cov = D R D where R has unit diagonal, then invert R instead of inv_cov
            d = np.sqrt(np.diag(inv_cov))
            nonzero = d != 0  # unconstrained observables have zeros
            inv_cov = inv_cov[np.ix_(nonzero, nonzero)]
            d = d[nonzero]
            d2 = np.outer(d, d)
            R = inv_cov / d2
            inv_R = np.linalg.inv(R)
            cov = np.diag([np.nan] * len(nonzero))
            cov[np.ix_(nonzero, nonzero)] = inv_R / d2
            mean = cov @ np.sum([w @ m for w, m in zip(weights, means)], axis=0)
        else:
            cov[cov == np.inf] = np.nan
        std = np.sqrt(np.diag(cov))
        corr = cov / np.outer(std, std)

        for i, row_names in enumerate(cls._observable_names):
            row_measurements = Measurement.get_measurements(row_names, include_measurements=include_measurements)
            row_idx = [observables.index(o) for o in row_names]
            hash_val = hash_names(row_measurements, row_names)
            cls(
                hash_val=hash_val,
                data_type='central',
                data=jnp.array(mean[row_idx], dtype=jnp.float64),
                row_names=row_names,
                col_names=[],
            )
            cls(
                hash_val=hash_val,
                data_type='uncertainties',
                data=jnp.array(std[row_idx], dtype=jnp.float64),
                row_names=row_names,
                col_names=[],
            )
            for j in range(i, len(cls._observable_names)):
                col_names = cls._observable_names[j]
                col_measurements = Measurement.get_measurements(col_names, include_measurements=include_measurements)
                col_idx = [observables.index(o) for o in col_names]
                hash_val = hash_names(row_measurements, col_measurements, row_names, col_names)
                cls(
                    hash_val=hash_val,
                    data_type='correlations',
                    data=jnp.array(corr[np.ix_(row_idx, col_idx)], dtype=jnp.float64),
                    row_names=row_names,
                    col_names=col_names,
                )

    @classmethod
    def get_data(
        cls,
        data_type: str,
        include_measurements: Iterable[str],
        row_names: Iterable[str],
        col_names: Optional[Iterable[str]] = []
    ):
        '''
        Retrieve the data array for the specified data type and combination of observables and measurements.

        Parameters
        ----------
        data_type : str
            The type of data to retrieve. It can be `correlations`, `central`, or `uncertainties`.
        include_measurements : Iterable[str]
            A list of measurement names to include in the retrieval.
        row_names : Iterable[str]
            The names of the observables corresponding to the rows of the data array.
        col_names : Iterable[str], optional
            The names of the observables corresponding to the columns of the data array. Default is an empty list.

        Returns
        -------
        Optional[jnp.ndarray]
            The data array for the specified data type and combination of observables and measurements, or `None` if not found.

        Examples
        --------
        Retrieve the data array for a specific data type and combination of observables and measurements:

        >>> data = ExperimentalCorrelations.get_data(
        ...     data_type='correlations',
        ...     include_measurements=['measurement1'],
        ...     row_names=['observable1'],
        ...     col_names=['observable2']
        ... )
        '''
        row_measurements = Measurement.get_measurements(row_names, include_measurements=include_measurements)
        col_measurements = Measurement.get_measurements(col_names, include_measurements=include_measurements)
        hash_val = hash_names(row_measurements, col_measurements, row_names, col_names)
        hash_val_transposed = hash_names(col_measurements, row_measurements, col_names, row_names)
        if hash_val not in cls._instances[data_type] and hash_val_transposed not in cls._instances[data_type]:
            cls.compute(include_measurements)
        if hash_val in cls._instances[data_type]:
            return cls._instances[data_type][hash_val].data
        elif hash_val_transposed in cls._instances[data_type]:
            return cls._instances[data_type][hash_val_transposed].data.T
        else:
            return None

    @classmethod
    def get_cov_scaled(
        cls,
        include_measurements: Iterable[str],
        row_names: Iterable[str],
        col_names: Iterable[str],
        std_exp_scaled_row: np.ndarray,
        std_exp_scaled_col: np.ndarray,
    ):
        '''
        Retrieve the scaled covariance matrix for the specified combination of observables and measurements.

        Parameters
        ----------
        include_measurements : Iterable[str]
            A list of measurement names to include in the retrieval.
        row_names : Iterable[str]
            The names of the observables corresponding to the rows of the covariance matrix.
        col_names : Iterable[str]
            The names of the observables corresponding to the columns of the covariance matrix.
        std_exp_scaled_row : np.ndarray
            The experimental standard deviations for the row observables, scaled by any additional factors.
        std_exp_scaled_col : np.ndarray
            The experimental standard deviations for the column observables, scaled by any additional factors.

        Returns
        -------
        jnp.ndarray
            The scaled covariance matrix for the specified combination of observables and measurements.

        Examples
        --------
        Retrieve the scaled covariance matrix for a specific combination of observables and measurements:

        >>> cov_scaled = ExperimentalCorrelations.get_cov_scaled(
        ...     include_measurements=['measurement1'],
        ...     row_names=['observable1'],
        ...     col_names=['observable2'],
        ...     std_exp_scaled_row=np.array([0.5]),
        ...     std_exp_scaled_col=np.array([1.2])
        ... )
        '''
        row_measurements = Measurement.get_measurements(row_names, include_measurements=include_measurements)
        col_measurements = Measurement.get_measurements(col_names, include_measurements=include_measurements)
        hash_val = hash_names(row_measurements, col_measurements, row_names, col_names)
        if hash_val in cls._covariance_scaled:
            cov_scaled = cls._covariance_scaled[hash_val]
        else:
            corr = cls.get_data('correlations', include_measurements, row_names, col_names)
            if corr is None:
                raise ValueError(f"Correlation data for {row_names} and {col_names} not found.")
            cov_scaled = corr * np.outer(std_exp_scaled_row, std_exp_scaled_col)
            cov_scaled = jnp.array(cov_scaled, dtype=jnp.float64)
            cls._covariance_scaled[hash_val] = cov_scaled
        return cov_scaled

`init(hash_val, data_type, data, row_names, col_names)`

Initialize an instance of the ExperimentalCorrelations class.

Parameters:

Name	Type	Description	Default
`hash_val`	`str`	A unique hash value representing the combination of measurements and observables.	required
`data_type`	`str`	The type of data stored in the instance. It can be `correlations`, `central`, or `uncertainties`.	required
`data`	`ndarray`	The data array containing the correlation matrix, central values, or uncertainties.	required
`row_names`	`Iterable[str]`	The names of the observables corresponding to the rows of the data array.	required
`col_names`	`Iterable[str]`	The names of the observables corresponding to the columns of the data array.	required

Returns:

Type	Description
`None`

Examples:

Initialize an instance of the ExperimentalCorrelations class:

>>> exp_corr = ExperimentalCorrelations(
...     hash_val='unique_hash_value',
...     data_type='correlations',
...     data=np.array([[1.0, 0.5], [0.5, 1.0]]),
...     row_names=['observable1', 'observable2'],
...     col_names=['observable1', 'observable2']
... )

Source code in jelli/core/experimental_correlations.py

def __init__(
    self,
    hash_val: str,
    data_type: str,
    data: np.ndarray,
    row_names: Iterable[str],
    col_names: Iterable[str],
) -> None:
    '''
    Initialize an instance of the ExperimentalCorrelations class.

    Parameters
    ----------
    hash_val : str
        A unique hash value representing the combination of measurements and observables.
    data_type : str
        The type of data stored in the instance. It can be `correlations`, `central`, or `uncertainties`.
    data : np.ndarray
        The data array containing the correlation matrix, central values, or uncertainties.
    row_names : Iterable[str]
        The names of the observables corresponding to the rows of the data array.
    col_names : Iterable[str]
        The names of the observables corresponding to the columns of the data array.

    Returns
    -------
    None

    Examples
    --------
    Initialize an instance of the ExperimentalCorrelations class:

    >>> exp_corr = ExperimentalCorrelations(
    ...     hash_val='unique_hash_value',
    ...     data_type='correlations',
    ...     data=np.array([[1.0, 0.5], [0.5, 1.0]]),
    ...     row_names=['observable1', 'observable2'],
    ...     col_names=['observable1', 'observable2']
    ... )
    '''
    self.hash_val = hash_val
    self.data_type = data_type
    self.data = data
    self.row_names = row_names
    self.col_names = col_names
    self._instances[data_type][hash_val] = self

`compute(include_measurements, n_samples=int(1000000.0), seed=None)` `classmethod`

Compute the correlation matrices, central values, and uncertainties from the specified measurements.

Parameters:

Name	Type	Description	Default
`include_measurements`	`Iterable[str]`	A list of measurement names to include in the computation.	required
`n_samples`	`int`	The number of samples to draw for numerical approximations of non-Gaussian distributions. Default is 1e6.	`int(1000000.0)`
`seed`	`int`	A random seed for reproducibility of the samples. Default is None.	`None`

Returns:

Type	Description
`None`

Examples:

Compute the correlation matrices, central values, and uncertainties from the specified measurements:

>>> ExperimentalCorrelations.compute(include_measurements=['measurement1', 'measurement2'])

Source code in jelli/core/experimental_correlations.py

@classmethod
def compute(
    cls,
    include_measurements: Iterable[str],
    n_samples: int = int(1e6),
    seed: int = None
) -> None:
    '''
    Compute the correlation matrices, central values, and uncertainties from the specified measurements.

    Parameters
    ----------
    include_measurements : Iterable[str]
        A list of measurement names to include in the computation.
    n_samples : int, optional
        The number of samples to draw for numerical approximations of non-Gaussian distributions. Default is 1e6.
    seed : int, optional
        A random seed for reproducibility of the samples. Default is None.

    Returns
    -------
    None

    Examples
    --------
    Compute the correlation matrices, central values, and uncertainties from the specified measurements:

    >>> ExperimentalCorrelations.compute(include_measurements=['measurement1', 'measurement2'])
    '''

    observables = list(chain.from_iterable(cls._observable_names))

    # get univariate constraints and combine them for each observable
    constraints_univariate = Measurement.get_constraints(
        observables,
        distribution_types=[k for k in logpdf_functions.keys() if k not in ['MultivariateNormalDistribution']],
        include_measurements=include_measurements,
        )
    constraints_list = []
    for observable in observables:
        constraints_observable = {}
        for dist_type, dist_info in constraints_univariate.items():
            mask = dist_info['observables'] == observable
            if np.any(mask):
                constraints_observable[dist_type] = {
                    k: v[mask] for k, v in dist_info.items()
                }
        if constraints_observable:
            constraints_list.append(constraints_observable)
    constraints_univariate = Measurement.combine_constraints(constraints_list)

    # construct covariance matrix and mean vector from univariate constraints
    cov = np.diag([np.inf] * len(observables))
    mean = np.zeros(len(observables))
    for dist_type, dist_info in constraints_univariate.items():
        if dist_type == 'NormalDistribution' or dist_type == 'HalfNormalDistribution':  # replace HalfNormalDistribution with zero-mean NormalDistribution
            central_value = dist_info['central_value']
            standard_deviation = dist_info['standard_deviation']
            observable_indices = dist_info['observable_indices']
        else:  # numerically obtain the Gaussian approximation
            samples = get_distribution_samples(dist_type, dist_info, n_samples, seed)
            central_value = np.mean(samples, axis=1)
            standard_deviation = np.std(samples, axis=1)
            observable_indices = dist_info['observable_indices']
        cov[observable_indices, observable_indices] = standard_deviation**2
        mean[observable_indices] = central_value

    # get multivariate constraints
    constraints_multivariate = Measurement.get_constraints(
        observables,
        distribution_types=['MultivariateNormalDistribution'],
        include_measurements=include_measurements,
    )
    if constraints_multivariate:
        # combine all covariance matrices and mean vectors using the weighted average
        weights = [np.diag(1/np.diag(cov))]
        means = [mean]
        constraints_multivariate = constraints_multivariate['MultivariateNormalDistribution']
        for i in range(len(constraints_multivariate['central_value'])):
            weight_i = np.zeros((len(observables), len(observables)))
            mean_i = np.zeros(len(observables))
            observable_indices_i = constraints_multivariate['observable_indices'][i]
            central_value_i = constraints_multivariate['central_value'][i]
            standard_deviation_i = constraints_multivariate['standard_deviation'][i]
            inverse_correlation_i = constraints_multivariate['inverse_correlation'][i]
            weight_i[np.ix_(observable_indices_i, observable_indices_i)] = inverse_correlation_i / np.outer(standard_deviation_i, standard_deviation_i)
            mean_i[observable_indices_i] = central_value_i
            weights.append(weight_i)
            means.append(mean_i)
        inv_cov = np.sum(weights, axis=0)
        # regularize inversion asuming inv_cov = D R D where R has unit diagonal, then invert R instead of inv_cov
        d = np.sqrt(np.diag(inv_cov))
        nonzero = d != 0  # unconstrained observables have zeros
        inv_cov = inv_cov[np.ix_(nonzero, nonzero)]
        d = d[nonzero]
        d2 = np.outer(d, d)
        R = inv_cov / d2
        inv_R = np.linalg.inv(R)
        cov = np.diag([np.nan] * len(nonzero))
        cov[np.ix_(nonzero, nonzero)] = inv_R / d2
        mean = cov @ np.sum([w @ m for w, m in zip(weights, means)], axis=0)
    else:
        cov[cov == np.inf] = np.nan
    std = np.sqrt(np.diag(cov))
    corr = cov / np.outer(std, std)

    for i, row_names in enumerate(cls._observable_names):
        row_measurements = Measurement.get_measurements(row_names, include_measurements=include_measurements)
        row_idx = [observables.index(o) for o in row_names]
        hash_val = hash_names(row_measurements, row_names)
        cls(
            hash_val=hash_val,
            data_type='central',
            data=jnp.array(mean[row_idx], dtype=jnp.float64),
            row_names=row_names,
            col_names=[],
        )
        cls(
            hash_val=hash_val,
            data_type='uncertainties',
            data=jnp.array(std[row_idx], dtype=jnp.float64),
            row_names=row_names,
            col_names=[],
        )
        for j in range(i, len(cls._observable_names)):
            col_names = cls._observable_names[j]
            col_measurements = Measurement.get_measurements(col_names, include_measurements=include_measurements)
            col_idx = [observables.index(o) for o in col_names]
            hash_val = hash_names(row_measurements, col_measurements, row_names, col_names)
            cls(
                hash_val=hash_val,
                data_type='correlations',
                data=jnp.array(corr[np.ix_(row_idx, col_idx)], dtype=jnp.float64),
                row_names=row_names,
                col_names=col_names,
            )

`get_cov_scaled(include_measurements, row_names, col_names, std_exp_scaled_row, std_exp_scaled_col)` `classmethod`

Retrieve the scaled covariance matrix for the specified combination of observables and measurements.

Parameters:

Name	Type	Description	Default
`include_measurements`	`Iterable[str]`	A list of measurement names to include in the retrieval.	required
`row_names`	`Iterable[str]`	The names of the observables corresponding to the rows of the covariance matrix.	required
`col_names`	`Iterable[str]`	The names of the observables corresponding to the columns of the covariance matrix.	required
`std_exp_scaled_row`	`ndarray`	The experimental standard deviations for the row observables, scaled by any additional factors.	required
`std_exp_scaled_col`	`ndarray`	The experimental standard deviations for the column observables, scaled by any additional factors.	required

Returns:

Type	Description
`ndarray`	The scaled covariance matrix for the specified combination of observables and measurements.

Examples:

Retrieve the scaled covariance matrix for a specific combination of observables and measurements:

>>> cov_scaled = ExperimentalCorrelations.get_cov_scaled(
...     include_measurements=['measurement1'],
...     row_names=['observable1'],
...     col_names=['observable2'],
...     std_exp_scaled_row=np.array([0.5]),
...     std_exp_scaled_col=np.array([1.2])
... )

Source code in jelli/core/experimental_correlations.py

@classmethod
def get_cov_scaled(
    cls,
    include_measurements: Iterable[str],
    row_names: Iterable[str],
    col_names: Iterable[str],
    std_exp_scaled_row: np.ndarray,
    std_exp_scaled_col: np.ndarray,
):
    '''
    Retrieve the scaled covariance matrix for the specified combination of observables and measurements.

    Parameters
    ----------
    include_measurements : Iterable[str]
        A list of measurement names to include in the retrieval.
    row_names : Iterable[str]
        The names of the observables corresponding to the rows of the covariance matrix.
    col_names : Iterable[str]
        The names of the observables corresponding to the columns of the covariance matrix.
    std_exp_scaled_row : np.ndarray
        The experimental standard deviations for the row observables, scaled by any additional factors.
    std_exp_scaled_col : np.ndarray
        The experimental standard deviations for the column observables, scaled by any additional factors.

    Returns
    -------
    jnp.ndarray
        The scaled covariance matrix for the specified combination of observables and measurements.

    Examples
    --------
    Retrieve the scaled covariance matrix for a specific combination of observables and measurements:

    >>> cov_scaled = ExperimentalCorrelations.get_cov_scaled(
    ...     include_measurements=['measurement1'],
    ...     row_names=['observable1'],
    ...     col_names=['observable2'],
    ...     std_exp_scaled_row=np.array([0.5]),
    ...     std_exp_scaled_col=np.array([1.2])
    ... )
    '''
    row_measurements = Measurement.get_measurements(row_names, include_measurements=include_measurements)
    col_measurements = Measurement.get_measurements(col_names, include_measurements=include_measurements)
    hash_val = hash_names(row_measurements, col_measurements, row_names, col_names)
    if hash_val in cls._covariance_scaled:
        cov_scaled = cls._covariance_scaled[hash_val]
    else:
        corr = cls.get_data('correlations', include_measurements, row_names, col_names)
        if corr is None:
            raise ValueError(f"Correlation data for {row_names} and {col_names} not found.")
        cov_scaled = corr * np.outer(std_exp_scaled_row, std_exp_scaled_col)
        cov_scaled = jnp.array(cov_scaled, dtype=jnp.float64)
        cls._covariance_scaled[hash_val] = cov_scaled
    return cov_scaled

`get_data(data_type, include_measurements, row_names, col_names=[])` `classmethod`

Retrieve the data array for the specified data type and combination of observables and measurements.

Parameters:

Name	Type	Description	Default
`data_type`	`str`	The type of data to retrieve. It can be `correlations`, `central`, or `uncertainties`.	required
`include_measurements`	`Iterable[str]`	A list of measurement names to include in the retrieval.	required
`row_names`	`Iterable[str]`	The names of the observables corresponding to the rows of the data array.	required
`col_names`	`Iterable[str]`	The names of the observables corresponding to the columns of the data array. Default is an empty list.	`[]`

Returns:

Type	Description
`Optional[ndarray]`	The data array for the specified data type and combination of observables and measurements, or `None` if not found.

Examples:

Retrieve the data array for a specific data type and combination of observables and measurements:

>>> data = ExperimentalCorrelations.get_data(
...     data_type='correlations',
...     include_measurements=['measurement1'],
...     row_names=['observable1'],
...     col_names=['observable2']
... )

Source code in jelli/core/experimental_correlations.py

@classmethod
def get_data(
    cls,
    data_type: str,
    include_measurements: Iterable[str],
    row_names: Iterable[str],
    col_names: Optional[Iterable[str]] = []
):
    '''
    Retrieve the data array for the specified data type and combination of observables and measurements.

    Parameters
    ----------
    data_type : str
        The type of data to retrieve. It can be `correlations`, `central`, or `uncertainties`.
    include_measurements : Iterable[str]
        A list of measurement names to include in the retrieval.
    row_names : Iterable[str]
        The names of the observables corresponding to the rows of the data array.
    col_names : Iterable[str], optional
        The names of the observables corresponding to the columns of the data array. Default is an empty list.

    Returns
    -------
    Optional[jnp.ndarray]
        The data array for the specified data type and combination of observables and measurements, or `None` if not found.

    Examples
    --------
    Retrieve the data array for a specific data type and combination of observables and measurements:

    >>> data = ExperimentalCorrelations.get_data(
    ...     data_type='correlations',
    ...     include_measurements=['measurement1'],
    ...     row_names=['observable1'],
    ...     col_names=['observable2']
    ... )
    '''
    row_measurements = Measurement.get_measurements(row_names, include_measurements=include_measurements)
    col_measurements = Measurement.get_measurements(col_names, include_measurements=include_measurements)
    hash_val = hash_names(row_measurements, col_measurements, row_names, col_names)
    hash_val_transposed = hash_names(col_measurements, row_measurements, col_names, row_names)
    if hash_val not in cls._instances[data_type] and hash_val_transposed not in cls._instances[data_type]:
        cls.compute(include_measurements)
    if hash_val in cls._instances[data_type]:
        return cls._instances[data_type][hash_val].data
    elif hash_val_transposed in cls._instances[data_type]:
        return cls._instances[data_type][hash_val_transposed].data.T
    else:
        return None

`load()` `classmethod`

Load observable names from all correlated observable sectors and initialize class-level attributes.

Returns:

Type	Description
`None`

Examples:

Load observable names from all correlated observable sectors:

>>> ExperimentalCorrelations.load()

Source code in jelli/core/experimental_correlations.py

@classmethod
def load(cls) -> None:
    '''
    Load observable names from all correlated observable sectors and initialize class-level attributes.

    Returns
    -------
    None

    Examples
    --------
    Load observable names from all correlated observable sectors:

    >>> ExperimentalCorrelations.load()
    '''
    observable_names = []
    for observable_sector in ObservableSector.get_all():
        if observable_sector.observable_uncertainties is not None:
            observable_names.append(observable_sector.observable_names)
    cls._observable_names = observable_names
    cls._instances = {
        'correlations': {},
        'central': {},
        'uncertainties': {}
    }
    cls._covariance_scaled = {}

jelli.core.experimental_correlations

ExperimentalCorrelations

__init__(hash_val, data_type, data, row_names, col_names)

compute(include_measurements, n_samples=int(1000000.0), seed=None) classmethod

get_cov_scaled(include_measurements, row_names, col_names, std_exp_scaled_row, std_exp_scaled_col) classmethod

get_data(data_type, include_measurements, row_names, col_names=[]) classmethod

load() classmethod

`ExperimentalCorrelations`

`init(hash_val, data_type, data, row_names, col_names)`

`compute(include_measurements, n_samples=int(1000000.0), seed=None)` `classmethod`

`get_cov_scaled(include_measurements, row_names, col_names, std_exp_scaled_row, std_exp_scaled_col)` `classmethod`

`get_data(data_type, include_measurements, row_names, col_names=[])` `classmethod`

`load()` `classmethod`