Metadata-Version: 2.4
Name: motifscan
Version: 1.3.0
Summary: A package for motif discovery and motif enrichment analysis
Home-page: https://github.com/shao-lab/MotifScan
Author: Hayden Sun
Author-email: sunhongduo@picb.ac.cn
License: BSD
Project-URL: Bug Tracker, https://github.com/shao-lab/MotifScan/issues
Project-URL: Documentation, https://motifscan.readthedocs.io
Project-URL: Source Code, https://github.com/shao-lab/MotifScan
Classifier: Development Status :: 5 - Production/Stable
Classifier: Environment :: Console
Classifier: Intended Audience :: Science/Research
Classifier: License :: OSI Approved :: BSD License
Classifier: Operating System :: Unix
Classifier: Operating System :: POSIX
Classifier: Operating System :: MacOS
Classifier: Programming Language :: Python
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.6
Classifier: Programming Language :: Python :: 3.7
Classifier: Programming Language :: Python :: 3.8
Classifier: Topic :: Scientific/Engineering :: Bio-Informatics
Requires-Python: >=3.6
Description-Content-Type: text/x-rst
License-File: LICENSE
Requires-Dist: numpy>=1.15
Requires-Dist: scipy>=1.0
Requires-Dist: requests
Requires-Dist: tqdm>=4.42.1
Requires-Dist: pysam>=0.15.0
Requires-Dist: matplotlib>=3.0.0
Provides-Extra: test
Requires-Dist: pytest>=4.0.0; extra == "test"
Requires-Dist: pytest-cov>=2.8.0; extra == "test"
Provides-Extra: docs
Requires-Dist: sphinx>=2.0.0; extra == "docs"
Requires-Dist: sphinx_rtd_theme; extra == "docs"
Dynamic: author
Dynamic: author-email
Dynamic: classifier
Dynamic: description
Dynamic: description-content-type
Dynamic: home-page
Dynamic: license
Dynamic: license-file
Dynamic: project-url
Dynamic: provides-extra
Dynamic: requires-dist
Dynamic: requires-python
Dynamic: summary

MotifScan
=========

.. image:: https://github.com/shao-lab/MotifScan/workflows/Python%20package/badge.svg
   :alt: Github Actions
   :target: https://github.com/shao-lab/MotifScan/actions
.. image:: https://readthedocs.org/projects/motifscan/badge/?version=latest
   :alt: Documentation Status
   :target: https://motifscan.readthedocs.io/en/latest/?badge=latest
.. image:: https://img.shields.io/pypi/v/motifscan.svg
   :alt: PyPI
   :target: https://pypi.org/project/motifscan/
.. image:: https://img.shields.io/pypi/pyversions/motifscan.svg
   :alt: Python Version
   :target: https://pypi.org/project/motifscan/
.. image:: https://img.shields.io/badge/install%20with-bioconda-brightgreen.svg?style=flat-square
   :alt: Bioconda
   :target: http://bioconda.github.io/recipes/motifscan/README.html
.. image:: https://codecov.io/gh/shao-lab/MotifScan/branch/master/graph/badge.svg
   :alt: Codecov
   :target: https://codecov.io/gh/shao-lab/MotifScan
.. image:: https://img.shields.io/pypi/l/motifscan.svg
   :alt: License
   :target: https://github.com/shao-lab/MotifScan/blob/master/LICENSE

Introduction
============

**Scan input genomic regions with known DNA motifs**

Given a set of input genomic regions, MotifScan scans the sequences to
detect the occurrences of known motifs. It can also applies a statistical test
on each motif to check whether the motif is significantly over- or under-represented
(enriched or depleted) in the input genomic regions compared to another set of control
regions.

Citation
========

`Sun, H., Wang, J., Gong, Z. et al. Quantitative integration of epigenomic variation and
transcription factor binding using MAmotif toolkit identifies an important role of IRF2
as transcription activator at gene promoters. Cell Discov 4, 38 (2018).`__

.. __: https://doi.org/10.1038/s41421-018-0045-y

Documentation
=============

To see the full documentation of MotifScan, please refer to: https://motifscan.readthedocs.io

Installation
============

The latest version release of MotifScan is available at
`PyPI <https://pypi.python.org/pypi/motifscan>`__:

::

    $ pip install motifscan

Or you can install MotifScan via conda:

::

    $ conda install -c bioconda motifscan

Usage
=====

Install genome assemblies
-------------------------

Install from a remote database
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

You can download genome assemblies from the `UCSC`_ database.

First, display all available genome assemblies:

.. code-block:: shell

    $ motifscan genome --list-remote

Then, install a genome assembly (e.g. hg19):

.. code-block:: shell

    $ motifscan genome --install -n hg19 -r hg19

Install with local files
^^^^^^^^^^^^^^^^^^^^^^^^

To install a genome assembly locally, you have to prepare a FASTA file
containing the genome sequences and a genome annotation file (refGene.txt).

.. code-block:: shell

    $ motifscan genome --install -n hg19 -i <hg19.fa> -a <refGene.txt>


Install and build motif sets
----------------------------

Install from a remote database
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

Users can install motif PFMs sets in the `JASPAR`_ 2020 database.

First, display all available motif PFMs sets in JASPAR 2020:

.. code-block:: shell

    $ motifscan motif --list-remote

Then, install a JASPAR motif PFMs set (e.g. vertebrates_non-redundant):

.. code-block:: shell

    $ motifscan motif --install -n <motif_set> -r vertebrates_non-redundant -g hg19


Install with local files
^^^^^^^^^^^^^^^^^^^^^^^^

Install a motif set with local PFMs file:

.. code-block:: shell

   $ motifscan motif --install -n <motif_set> -i <pfms.jaspar> -g hg19

Build PFMs for additional genome
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

Build the motif PFMs set for another installed genome assembly hg38:

.. code-block:: shell

   $ motifscan motif --build <motif_set> -g hg38

Scanning Motifs
---------------

After the data preparation steps, you can now scan a set of genomic regions to
detect the occurrences of known motifs.

.. code-block:: shell

   $ motifscan scan -i regions.bed -g hg19 -m <motif_set> -o <output_dir>

.. _UCSC: https://genome.ucsc.edu/
.. _JASPAR: http://jaspar.genereg.net/

**Note:** Using -h/--help for the details of all arguments.


License
=======

`BSD 3-Clause
License <https://github.com/shao-lab/MotifScan/blob/master/LICENSE>`__
