Skip to content

plrigaux/firstNames

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

First Names Creator

This is a program to create "original" first names.

Description

This program takes as input a list of first names from it generates new names sounding close to the ones in the input.

To be able to accomplish this, it uses Markov Chain taking to account the last two characters to determine the next one. The program also calculate the probability that a character sequence begins or ends the name. The probability table takes into account the frequency of the name then a character sequence coming from a frequent name will have more weight than one coming from less frequent names.

The output marks by appending asterisk character (*) at the name's end generated names found in the input (i.e. the program generated an existing name).

How to use

For usability reasons the program is separated between 2 scripts. The first one creates the 3D probability table. The second one generates the output based on the created table.

Configuration

You can easily configure some parameters to suits your needs. Here parameters that you can change:

  • The input source
  • The output file
  • Number of generated name limit
  • The minimal name length
  • The maximum name length
  • The Random Seed

The Data

The input data is the 2017 Quebec's baby boy names with frequency.

You can find the main site here.

Output

Here a sample output it generates:

  • LUCALE
  • DYLAUD
  • MILOIN
  • RAPHAN
  • RAYSON
  • JAMUEL
  • DAMUELIOT
  • NOLIAM
  • PAVYKO

Source

The inspiration of this program comes from "The machine inventing words" project.

You can find the source and description by following this link https://sciencetonnante.wordpress.com/2015/10/16/la-machine-a-inventer-des-mots-video/

About

An original first name creator program

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages