Mathematical Language Identification

This is the repository for our Linearity final project at Olin College.

By analyzing the next-letter pairings of a large body of reference text and performing singular value decomposition on this data, we were able to identify the best match language of an arbitrary text.

Points of interest:

Report.pdf - our final report

Identification Results.pdf - the identification results from running a collection of sample text through our programs

LinearitySVDshort.mlx - MATLAB code used to guess the language of each text

LinearitySVDFinal.mlx - Same as -short, but makes graphs of reference data

SVDProjectTesting.mlx - Implementing SVD in MATLAB the longer way

adjacencyMatrix.py - Used to generate letter adjacency data from text files

Graphs - Graphs of the first & second singular vector plots for the reference data

Reference Texts - Reference text in each language

Unknown Texts - Sample files which we identified the lanuguage of

README.md - You are here

~ Jane Sieving & Sabrina Pereira, Spring 2018 ~

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Mathematical Language Identification

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
Graphs		Graphs
Reference Texts		Reference Texts
Unknown Texts		Unknown Texts
Identification Results.pdf		Identification Results.pdf
LinearitySVDFinal.mlx		LinearitySVDFinal.mlx
LinearitySVDshort.mlx		LinearitySVDshort.mlx
README.md		README.md
Report.pdf		Report.pdf
SVDProjectTesting.mlx		SVDProjectTesting.mlx
adjacencyMatrix.py		adjacencyMatrix.py

jsieving/svd-languages

Folders and files

Latest commit

History

Repository files navigation

Mathematical Language Identification

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages