Harvesting terminology from a multilingual website
- Download the website with SiteSucker for OS X or with HTTrack for Windows.
- Determine the naming scheme for the source and target language HTML files.
- Use a file commander (Crax for OS X, TotalCommander for Windows) to place them in separate folders.
- If necessary, use a file commander with multi rename feature and regular expressions to simplify/align the file names.
- Load both folders in AlignFactory Light.
- Adjust the settings to your preferences.
- Create a TMX file.
- Open the TM in CafeTran.