Metadata-Version: 2.1
Name: py-stringsimjoin
Version: 0.3.2
Summary: Python library for performing string similarity joins.
Home-page: https://sites.google.com/site/anhaidgroup/projects/magellan/py_stringsimjoin
Author: UW Magellan Team
Author-email: uwmagellan@gmail.com
License: BSD
Platform: UNKNOWN
Classifier: Development Status :: 4 - Beta
Classifier: Environment :: Console
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Science/Research
Classifier: Intended Audience :: Education
Classifier: License :: OSI Approved :: BSD License
Classifier: Operating System :: POSIX
Classifier: Operating System :: Unix
Classifier: Operating System :: MacOS
Classifier: Operating System :: Microsoft :: Windows
Classifier: Programming Language :: Python
Classifier: Programming Language :: Python :: 2
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 2.7
Classifier: Programming Language :: Python :: 3.5
Classifier: Programming Language :: Python :: 3.6
Classifier: Programming Language :: Python :: 3.7
Classifier: Topic :: Scientific/Engineering
Classifier: Topic :: Utilities
Classifier: Topic :: Software Development :: Libraries
License-File: LICENSE
Requires-Dist: joblib
Requires-Dist: pandas (>=0.16.0)
Requires-Dist: PyPrind (>=2.9.3)
Requires-Dist: py-stringmatching (>=0.2.1)
Requires-Dist: six

py_stringsimjoin
================

This project seeks to build a Python software package that provides scalable implementation of string similarity joins over two tables, for commonly used similarity measures such as Jaccard, Dice, cosine, overlap, overlap coefficient and edit distance. The package is free, open-source, and BSD-licensed.

Important links
===============

 * Project Homepage: https://sites.google.com/site/anhaidgroup/projects/magellan/py_stringsimjoin
 * Code repository: https://github.com/anhaidgroup/py_stringsimjoin
 * User Manual: http://anhaidgroup.github.io/py_stringsimjoin/v0.3.2/index.html
 * Overview: https://anhaidgroup.github.io/py_stringsimjoin/v0.3.2/overview.html
 * How to Contribute: https://anhaidgroup.github.io/py_stringsimjoin/v0.3.2/contributing.html
 * Issue Tracker: https://github.com/anhaidgroup/py_stringsimjoin/issues
 * Mailing List: https://groups.google.com/forum/#!forum/py_stringsimjoin

Dependencies
============

py_stringsimjoin has been tested on Python 2.7, 3.5, 3.6, and 3.7.

The required dependencies to build the package are pandas 0.16.0 or higher, py_stringmatching 0.2.1 or higher,
joblib, pyprind, six and a C++ compiler. For the development version, you will also need Cython.

Platforms
=========

py_stringsimjoin has been tested on Linux, OS X and Windows.


