Offline Usage
FLUTE requires Python installed on local machine (macOS, Linux and Windows are all supported). If users want to explore the interactive notebook we provided locally, Jupyter notebook installation is also required.
Clone the FLUTE repository, to your computer.
git clone https://github.com/pitt-miskov-zivanov-lab/FLUTE.git
Navigate into the directory, install FLUTE and its python dependencies.
cd FLUTE
pip install -e .
Use either in terminal or via notebook.
3.1 In terminal: change working directory to src/ folder by cd src/, first unzip a large file unzip ../supplementary/oa_file_list.txt.zip -d "../examples/input/", and then see what parameters are needed in run_FLUTE.py via helper
python run_FLUTE.py -h
>> usage: run_FLUTE.py [-h] [-g] [-p] [-q [PROT_QUERY]] [-r] username password host db_name input_file output_path
That’s being said, six ordered positional arguments are required: username password host db_name input_file output_path,
some optional flag/arguments can be appended to support additional features:
More Arguments |
Examples |
Descriptions |
|---|---|---|
No optional ones |
|
Filter and score input interactions |
With |
|
Also output duplicate interactions in input file |
With |
|
Also output interactions published in papers of recent years |
With |
|
Also find interactions related to these given protein IDs |
With |
|
Also find additional interactions related that are in the same paper set |
Combine |
|
Do both |
3.2 Via notebook: Run the provided notebook after cd examples/ from FLUTE root directory. Detailed instructions can be found in markdown cells inside the notebook or at this page
jupyter notebook examples/use_FLUTE.ipynb
Prepare input files and interpret output files
4.1 Input and parameters include:
a
.xlsxfile containing the machine reading interactions, best in BioRECIPES interaction format, see example. At the minimum requirements, it shall have the following headers:
Regulated Name
Regulated ID
Regulated Type
Regulator Name
Regulator ID
Regulator Type
Paper IDs
a score tuple
(es, ts, ds), three positive numbers, denoting thresholds of escore, tscore, dscore. Default value is set to be (0,0,0), update it inmain()(run in terminal) or in Cell 18 and 27 (run with notebook).If using the feature of returning recent papers, integer
xis needed to specify how many years to return interactions published in recent papers. Default is 5, update it inmain()(run in terminal) or in Cell 38 (run with notebook).If using the feature of querying protein IDs and returning related interactions, string of protein IDs is needed, e.g.,
P00533,P03386, supported after-q(run in terminal) or in Cell 32 (run with notebook).
4.2 Outputs (including optional ones) include:
list of reading interactions that pass filtration, see example
the filtration scores for those filtered interactions, see example
OPTIONAL: duplicated reading interactions and their occurrences, see example
OPTIONAL: interactions published in recent x years, see example
OPTIONAL: interactions related to query protein IDs, see example
OPTIONAL: interactions that are in the same papers as the input file, see example