Skip to content

Latest commit

 

History

History
59 lines (39 loc) · 1.64 KB

README.md

File metadata and controls

59 lines (39 loc) · 1.64 KB

Build Status Dependency Status devDependency Status

node-talon

Port of mailgun/talon (signature detection in mails) from Python to JavaScript.

This can be used to extract and strip signature from mail messages, allowing safer automatic parsing for example.

Installation

npm install --save talon

Usage

var talon = require("talon");
var extractSignature = talon.signature.bruteforce.extractSignature;

var message = "Wow. Awesome!\n--\nBob Smith";

console.log(extractSignature(message));

expected output:

{ text: 'Wow. Awesome!', signature: '--\nBob Smith' }

Why the whole talon.signature.bruteforce.extractSignature?

In the original library, there are multiple ways of extracting signatures. Here is implemented only one of the available methods, but I chose to keep the same namespace structure.

Roadmap

What's working?

  • The bruteforce method of extracting signature
  • Added French support

What's not working?

  • Everything else, including:
    • machine-learning way of extracting signature
    • quotations extraction

What's planned?

  • We'll need quotations handling very soon, so it will land fast
  • The machine-learning thing is really not a priority