Description:
Upload corrupt docx, xlsx, pptx, odt, ods or odt or files for text
extraction. Even if the corresponding Office 2007 or OO application
itself can't extract the file, this utility may work and it may be
possible to avoid retyping or re-entering the data.
Additionally, this service will work with non-corrupt files. The service
even extracts text
from doc and rtf files but is unlikely to be
successful with corrupt files for those extensions.
New, three alternative results returned for docx files, arrived at via different coding algorithms!
Description: this is a
GUI
version of the great
docx2txt
Perl script by
Sandeep Kumar.
It will extract text from damaged/corrupted Word 2007 files where Word 2007
fails.
Word 2007 files are actually zipped collections of
XML files and XML as a format is unforgiving of data corruption.
The main text in Word 2007 docx files is found in document.xml file in
the collection. Damaged docx2txt uses CakeCMD , an unzipper that
will unzip partially corrupt document.xml files. Also the Perl
routine used to extract the text from the document.xml file doesn't care
about well-formedness of the XML, a possible stumbling block of Word
2007.
The files in this package
include the command line version of the zip program
CakeCMD. Any
feedback is much appreciated.
Support requests can be sent to the feedback
E-mail.
If Damaged docx2txt fails, or you need
more formatting recovered, try
WordFIX...
Description: Corrupt xlsx2csv is a
new freeware GUI program for salvaging the data from corrupt Excel 2007
files. Xlsx Excel 2007 files are really zipped collections of XML
files. The main raw data is contained in the sharedStrings and
numbered worksheet XML files. XML is a very unforgiving medium
when it comes to data corruption, thus if the sharedStrings and or
worksheet XML files become corrupt, Excel has difficulty recovering the
unformatted data.
Corrupt xlsx2csv uses a command line unzipping
program that will unzip partially corrupt worksheet[#].xml and
sharedStrings.xml files. Also the Perl data extraction routines
don't use XML techniques that care about well formed XML, a stumbling
block for other Excel 2007 recovery programs.
If corruptxlsx2csv doesn't
work, or you need format recovery try
ExcelFIX...
Description:
Coded by Ccy, author of HaHa Zip and using Delphi Zip,
CMD Corrupt OfficeOpen2Txt will often recover text from corrupt Office
2007 docx, xlsx, and pptx format files where the respective Office 2007
or 2010 programs cannot make the basic salvaging of the text or data.
Office 2007 Office Open format
files are zipped collections of XML files. There are two kinds of
corruption of these types of files, zip structure corruption and
corruption of the XML files containing the actual text or data and/or
the formatting. The unzipping module used in Office 2007 and 2010,
appears to be more finicky than InfoZip module used by CMD Corrupt
OfficeOpen2Txt. Thus the underlying XML can often be extracted as raw
material for this new program even though this is not available to
Office 2007 and 2010 programs.
In regards to the other type of
corruption, XML is by design a very unforgiving medium for file damage.
From the errors returned from attempts at salvaging the text from
corrupt docx and pptx files as well as the data from xlsx file, Office
2007 and 2010 appear to be using a standard interpreter of XML. CMD
Corrupt OfficeOpen2Txt on the other hand uses coding that is more
tolerant of XML errors.
Description: this
Google Group contains 500+ link to freeware for recovering data lost
to file corruption, deletion, failing disk or lost passwords.
It contains many more links than
my data recovery freeware site, however, it goes into less
detail.
Description: This Excel file features
the ability to convert Excel data into the genealogy standard, GEDCOM format including the ability to specify
family relationships. GEDCOM generation is initiated with a push of a VBA
coded macro button.
Description:
This Microsoft Access database converts the
Catalogue of Life
into a gedcom format. This extends the use of the gedcom format
for use in Biological Systematics or species categorization. This
move is inspired by my belief that at least some if not most speciation
is produced by hybridization, not the accumulation of mutations no
matter how radically changing in body format one mutation can be.
I believe that especially
during the epochs of mass extinctions (for instance now), environmental
niches can become compressed and overlap where they previously had not.
This brings species into contact that would not normally be so.
Also the fact of reduced numbers, populations individuals will sometimes
choose mates outside their species. Most of these matings are
unsuccessful, but occasionally because of radiation exposure or
colchicine like biochemical induced, chromosome numbers or other
nuclear changes the offspring become fertile.
This means that the birth of a
species takes a mother and a father species with their children being
different from either parent as they are a mixture. A clear
example to me is Sea Squirts, the parent of all vertebrates. They
make cellulose (and chlorophyll?) and appear to be a cross between say
another filter feeder like a sponge, and a plant which has mobile sperm,
perhaps like a fern.
Description: this is the Google group version contains links,
files and reviews of famous family trees including for instance the Bible
family, Greek Gods, Royalty, Presidents etc. File will be in GEDCOM (GED),
PDF or other formats.
Description:
This AutoHotKey script derived application will run in the System Tray.
Holding down the shift key and tapping on the F3 one will cycle your
selected text through invert, lower, upper, title and sentence cases.
This program copies the similar
feature in MS Word that uses the same key combination. The feature in
Word omits the invert and sentence case but these can be accessed from
customizing the Quick Access Toolbar.
The script is mostly by None
with the sentence case function by Laszlo both from the AutoHotKey
forum. The script idea was suggested and after minor synthesis and
adjustment compiled by me socrtwo (Paul D Pruitt).
Description:
Based on the
Delphi-Zip library and
coded by
Ccy
for
S2 Services, No-Frills
Command Line Unzipper offers the advantage of not requiring command
switches. Commands are interpreted according to argument order. The
first argument is the zip file, the second is the unarchive folder and
the third and more are files to be unarchived.
If the second argument is extension-less, it is interpreted as a folder.
Also note if no extension-less argument is provided in the second
position the unzip folder is the current one. Several files to be
unzipped can be placed in the 3rd, 4th, 5th etc arguments with a space
as a separator (or 2nd, 3rd etc if no folder name is indicated).
NNo-frills has some additional advantages over for example zlib based
unzippers. It unzips Microsoft Office 2007 files without an added a zip
extension. It excels at unzipping corrupted zip and Microsoft Office
2007 files without the specification of additional options.
Note: for corrupt zip
files, misleading errors have yet to coded out. The successful
unzipping for instance of files failing the cyclic redundancy
check within the archive will occur despite the errors
indicated.
Description: SoftSearch is a search
engine tool which allows you to do bulk Web searches. It allows you to
upload a text file of up to 100 search terms or phrases separated by
carriage returns. It then returns 2-20 results for each term in html or
text. It creates an Access database to do this. You need Access
installed or maybe an Access runtime. Also requires comdlg32.ocx,
not included with the distribution.
Size: 104 KB zipped
OS: 2000, XP, Vista?
Screenshot: None available
Contact Info We are happy to answer support
E-mails at this time at
socrtwo@s2services.com