Project Icon

DataProfiler

Streamline Data Analysis and Detect Sensitive Information with Python

Product DescriptionDataProfiler is a Python library that transforms data analysis and sensitive data detection. It supports file types such as CSV, JSON, and Parquet, and efficiently loads them into Pandas DataFrames. The library excels in profiling data, recognizing schema, statistics, and sensitive data elements like PII/NPI. Featuring a straightforward setup and a pre-trained deep learning model, it offers flexibility for adding new entities or pipelines for entity recognition. Ideal for automated data monitoring and generating comprehensive reports, DataProfiler integrates seamlessly into various workflows, offering valuable insights.
Project Details