Variety and panel filtering complexΒΆ

Variety and panel filtering properties complex is a special pair of enumerated filtering properties. The complex is based on support of symbol panels as solution items.

The variety of symbols for some dimension might be too wide. Gene symbols is the base example of such dimension (estimately 20000 items, only protein coding genes are taken in account), and currently it is the single dimesion of such a kind that is supported in the system. Since the nomenclature of symbols is so wide, the user has plenty of reasons to make selections for some groups of symbols. These groups form symbol panels, and the system provides functionality to operate them as solution items. This is the beackground of variery/panel complex.

The complex is needed when nomenclature of symbols is too wide. Gene symbols is the base example of such a nomenclature, and the system iterprets it as "dimension" named as Symbol. There are plenty of reasons to select some of these symbols, form symbol panel and use these panels in filtration process. So the system provides functionality for access and modification of symbol panels as solution items. And this is the background functionality of variery and panel complex support.

Variery filtering unit, is a special subtype of enumerated unit that collects statistics (see Status report mechanism (with delays)) in a special restricted way. In contrast to the ordinary enumerated property, the statistics for this property does not include whole list of property values(symbols):

  • only active symbols are traced in complete form;

    • by default, active symbols form special hidden symbol panel, see details here

    • active symbol list can be controlled manually with ctx argument

  • other symbols are traced only if they really present in the applied set (with positive count, ones with zero count are ignored), and only if the list of these symbols is short enough (up to 300 now)

Panel filtering unit is just enumerated unit that represents presence of symbols from variety unit (all symbols, not only active ones) in available symbol panels.

Currently the system provides two pairs of units:

Initially list of active symbols for a new dataset is empty, and it is a responsibility of user to point out here the symbols of interest. But the application can help the user in this completion. On REST API level there is parameter actsym in two requests (ds_stat, dtree_set); if this option is on, all symbols used in filtration tools are automatically added to active symbol list. Other helpful features can be implemented in the Front-End application.

Technical notes:

  • There is technical difference between panel units and ordinary enumerated ones. Evaluation of statistics for panel and variety unit pair is just the single evaluation procedure, so the real statistics for panels is a part of statistical report of variety unit, and formal statistics for panel unit is kept empty.

  • Both pairs of variety/panel complex in the current version use the same named dimensions: variety unis use the same named dimension Symbol, and panel units use panel.Symbol. This markup feature allows to detect usage of concrete symbols and panels from filtration tools.

  • Technical name of hidden panel of active symbols for dimension Symbol is __Symbol__