topbanner_forum
  *

avatar image

Welcome, Guest. Please login or register.
Did you miss your activation email?

Login with username, password and session length
  • Monday March 18, 2024, 10:50 pm
  • Proudly celebrating 15+ years online.
  • Donate now to become a lifetime supporting member of the site and get a non-expiring license key for all of our programs.
  • donate

Show Posts

This section allows you to view all posts made by this member. Note that you can only see posts made in areas you currently have access to.


Messages - Cocoa [ switch to compact view ]

Pages: [1]
1
@Dezoaan Thanks, I got the sfv working now.

This might be a bit confusing but where you save the SFV file is important in relation to which files are in the list:
  • Files that are on the same drive as the saved SFV file path are saved with paths relative to the save folder.
  • Files that are on a different drive than the saved SFV file path are saved with full paths.

Thank you for the reply, but as an average user, it IS rather confusing for me why the location where the sfv file is saved should matter. It's especially hard to understand just from reading the two sentences.  :huh: BtW, by "same drive" do you mean an entire physical hard drive itself or just a logical partition?

I've managed to accomplish the initial goal of comparing those two folders by saving the sfv file first inside the first folder. After some more time testing and using SFV Ninja, I do have some additional questions:
There are some cases where it's not possible to save the sfv file in the exact same location as the files/folders the checksums were being generated for. For example, when reading the contents of a networked drive or an optical disk/image. The original path might be read-only. Is it absolutely necessary to save the checksum in the exact same path? Is there another way?

Also, it would be helpful to be able to export the results of the checksum comparison report as a txt or html formatted file. I've tried to directly use Ctrl + C to copy the results and paste it into notepad, but it appears the contents are not possible to copy into clipboard.

2
I have two folders, say named Folder A and Folder B.
They're stored under different paths and have different names. Inside are other nested folders and files, which have the same general directory structure, but may or may not all match. What I'm trying to find out is exactly what's there and what's not, and for the ones that have same filename and patch, whether they're indeed the exact same files.

The fact that SFV Ninja stores relative paths instead of absolute paths lets you accomplish what you want:

  • Use SFV Ninja to create a checksum file for Folder A.
  • Save the resulting checksum file in Folder A.
  • Move the checksum file (or copy it) to Folder B.
  • Open the checksum file with SFV Ninja and have it verify the checksums.
That's what I thought would be the steps, except when I created the checksum file and moved it to the other folder, then selected "Load checksum file" -> "Verify All", all I got was the program re-verifying all the files in the initial folder that the checksum was created from. I also tried "Scan folder for checksum file and load" and got the same result.

Also would like to make a suggestion to Jody/skwire (not sure which you prefer), can you add a "clear selected" option? Sometimes someone might want to clear only a portion of the results or list before saving it to file, and being able to clear the exact ones a user needs instead of everything or everything of one type would be nice.

Additionally, thank you both for the software suggestions. I'm aware of WinMerge, though I haven't heard of Beyond Compare before. At a glance, Beyond Compare would seem to be perfect for my other need of maintaining and cross syncing backups of folders and files on multiple hard drives. I've been searching for something like that as well, so thank you! I'll definitely give it a try.

It's true that from the singular usage example I described, there are other types of software that can do what I need. But I really value efficiency in software and like to look for applications that can do as much of what I need as possible, with as little resource usage as possible. Since I also have a need for generating sfv files to send to other people for their verification purposes, I was mainly trying to find such a checksum generating software that could also let me compare the two folders in this way. It would seem to be a very simple and basic usage need that's quite standard for this type of application. I honestly did not anticipate the difficulty I would have in finding what I needed, but I'm very glad I came here.

3
Is it possible to add support for the file formats BPG and WebP? They've both been out for a good while now (though still not that popularized due to lack of software support), and the image comparison tool here shows a far superior image quality as well as file size ratio.

this off-topic, just to say that the difference there between BPG and JPG is very impressive
Yep. Too bad I still can't find a single image viewing/management software that supports BPG. Or at least one that is of decent functionality anyway. IrfanView has at least added support for WebP through a plugin, so I thought if I can't get BPG, at least WebP would be better than nothing. Both are better than JPG in many ways (WebP also supports lossless), though in terms of lossy, BPG is better than both by bounds and leaps.

4
Hey, mouser!

Long time no talk.  :) Hope you're doing well. You probably don't remember me though we exchanged several emails and messages with discussions back and forth a long time ago when I first joined; but then you likely talk to everyone.  :D

Came back to DC here (<- is that the generally accepted acronym?) to once again resolve some small software needs, and decided to pop in this thread as well with a feature request. Hope you don't mind ;)

Is it possible to add support for the file formats BPG and WebP? They've both been out for a good while now (though still not that popularized due to lack of software support), and the image comparison tool here shows a far superior image quality as well as file size ratio. I'm always about optimization and efficiency; not to mention for someone who deals with a lot of screenshots with a limited disk space, this is especially of interest.

5
Hello!

I've very glad I spotted your software fairly quick after coming to the forums. I spent the entire evening testing out around 7 or 8 different checksum generating or verifying software after trawling the internet, and in the end I had to come back to Donationcoder to find something even close to approaching what I need. I just wish I had thought of coming here sooner, but the forum is still a bit difficult to sift through to find a particular software for a particular need.

Your SFV Ninja's ability to generate checksum files with relative paths is a big bonus for me to decide to try it out--all the other tools glaring oversight to only be able to generate and verify checksums with static paths frustrated me to no end... (You'd think this was one of the most basic things that's needed for verifying checksum files )

Unforunately I still can't seem to figure out how to have SFV Ninja do all that I need, which is really quite simple:
I have two folders, say named Folder A and Folder B.
They're stored under different paths and have different names. Inside are other nested folders and files, which have the same general directory structure, but may or may not all match. What I'm trying to find out is exactly what's there and what's not, and for the ones that have same filename and path, whether they're indeed the exact same files.

So I need a file checksum tool that can do the following:

1. Scan and generate checksum files such as SFV or other formats for a folder, e.g. Folder A, with all subfolders and files inside.
2. Use the generated checksum file to compare against another folder, with different path and top level folder name, e.g. Folder B for the contents INSIDE.
3. Generate a report that tells me what matches and what's missing/changed. This doesn't have to be very comprehensive, but should be easy to read.

In the short amount of time I had left for the evening, I couldn't test SFV Ninja very thoroughly, but it seems it already got two out of the three down pretty well. Tihe only issue I have left is how can I accomplish the missing one (#2 in the steps)?

6
Thanks so much for providing the UTF8 conversion script and the latest version of the HTML2CGP tool. I tested both and here are the results.

HTML2CGP:

Like before, the conversion works perfectly for converting from cgp to html, but when converting back from html to cgp, it's actually not quite as straightforward due to the more complicated format of the html file.

There are several things that needs to be addressed:

1. Slimjet exports ALL bookmarks as a single html file, without the option to exclude, but I neither need, nor want the entire bookmark collection to be converted back into cgp groups wholesale. So I would really like an option to specify a folder name that the exporter should look for, for example "Export to GB", under which I will copy over all the bookmarks and their folders that I actually want to be converted into cgp. Additionally, the folders under this folder might have multiple levels (nested folders), and the export needs to be able to export only those that have urls (bookmarks) DIRECTLY under them, while retaining the folder name as the cgp group file name. (Please see screenshot for clarification on what the format might be like.)

2. Since the folder names are to be converted into cgp group file names, some folder names may contain characters that are illegal for Windows file names. If the converter runs into any of them, it would be immensely helpful to have it auto-replace the illegal characters with an underscore "_"

[attachimg=#1][/attachimg]
[attachurl=#3][/attachurl] (html format of the above example)

X2UTF8:

I hate to bother you about this in addition to the converter, but I tested the script with different files, and it seems it's not accepting the encoding names. First I tried Shift-JIS, and then I switched to a cgp file, using the "gb2312" you gave as an example, but I kept getting the same error as below:
[attachimg=#2][/attachimg]

7
(^ ^!) I think with about a hundred lines flashing past in the span of a few seconds for all the cgps I had to convert, I would need to have super humanly fast reading speed, rather than slower, to catch that 1 line at the beginning. I did check the the last few lines very carefully to look for some info on the final conversion results, as that's where the info usually is, but unfortunately didn't expect that to be at the very beginning or I would have tried to go back and see it.

I've tested importing and all has gone very smoothly this time. As far as I can tell, the nearly a thousand urls and their names in all the groups are accounted for and any Japanese or Chinese characters have also been transferred intact.  Even the non-Romance file names of the cgp groups have been handled just fine. ;D

Thank you so much, 4wd for all your work and unwavering support that finally created this tool. Being able to retain all the urls and info collected over the years is invaluable and I have thoroughly enjoyed the process of our tackling the challenges together.  :D

In case you or anyone else feels like adding additional features in the future, such as reverse converting back from html to cgp, I'll still more than welcome them.

BTW, I'm curious about the workings of the program you found that could perform encoding conversions. Could you provide a link to the original?

8
Hey, 4wd

I had both the script and the shortcut in the same folder and you had said I didn't need to change anything if I did that. After I followed your instructions in the latest post and amended the target path in the shortcut properties to

%SystemRoot%\system32\WindowsPowerShell\v1.0\powershell.exe -noprofile -executionpolicy bypass -File "C:\Temp\CGP2HTML\v3\CGP2HTML.ps1"
(Which is where I stored both the script and the shortcut)

I got the same result as before, a white blinking cursor in a powershell window with nothing else.

I decided to try updating Powershell to 4.0, the version that was in your system, Windows 8.1. This time, the cursor window also brought up a box to choose a folder, similar to your screenshot.

I pointed it to the folder where the cgp files were stored, the conversion happened, then a prompt said "conversion finished, press any key to exit".

I did that.

The problem now is... I can't find the html file that was supposed to be generated after that. It's not in the cgp folder, or in the folder where the script was. I ran it a second time with the updated files in your previous post, same result. I tried using Windows Search to do a full hard drive search for the file, but since I turned off indexing, don't know the file name, and have no other more advanced file search tool, it's likely trying a needle in a haystack. Where is the html file??

9
Try adding the -noprofile -executionpolicy bypass switches to the command line in the shortcut.
Tried it, but this time all it shows is a blinking white input cursor  :huh:


BTW, you did amend any path in the shortcut to point to wherever you've put the script, right?
Which path exactly do you mean? If you mean the "start in" box for the shortcut, I removed the old path. Based on previous experience with shortcuts in general, I didn't fill in any new ones because I didn't think it was necessary, but now I have updated it. As with the previous result, running the shortcut gets the same white blinking cursor and nothing else.

After further consideration, I've also tried replacing the -File with -"C:\Program Files (x86)\Green Browser\Groups", which is where the cgp files are stored, but this time I once again got a window with red error messages that disappeared too fast for me to read or screenshot. I tried running it repeatedly after that to capture a screenshot, but it seems all I'm getting now are blank windows that last less than a second, without the error messages anymore.

10
Hi 4wd,
I'm getting a window with errors when double-clicking the shortcut. It was gone too fast, but after several tries I managed to capture this scrnsht. After some research, it seems this is due to the fact the default security settings in Windows does not allow the execution of unsigned scripts. Is there a way you can sign it? Or do I need to permanently set the security level to "unsigned" in order to run the script? I hope there is a more secure workaround to avoid opening a permanent security risk for all unsigned scripts on my system.

error scrnsht.gif

I found this while searching for a solution. Perhaps you can make use of it, or help me understand how to use it?

On my machine that I use to dev scripts, I will use -unrestricted as above. When deploying my scripts however, to an end user machine, I will just call powershell with the -executionpolicy switch:
powershell.exe -noprofile -executionpolicy bypass -file .\script.ps1

11
 :-[ Sorry, besides the fact I switch between different system locales depending on what I need, I didn't think it was necessary to know the initial encoding in order to convert from ANSI to Unicode. I apologize for any unnecessary detours caused.  :P

I'm glad you found something to simplify the encoding conversion process. :) I think ultimately, all data is stored in binary format, so while it may appear to be gibberish, it's not actually gibberish, as long as it hasn't been corrupted. As I understand it, that data just needs the proper key to be "translated" into human readable form, which in this case, is what the character encodings are for. That's why the characters in the cgp files are able to be consistently displayed in Green Browser on my computer, as well as show up correctly in Slimjet, once the proper encoding was selected. (They might not show up properly on your system until you have the proper encoding installed, though I'm pretty sure most post-WinXP OS's already have most encoding installed by default). It's only when the data gets translated with the wrong encoding and then stored in the "translated" form, then they will get corrupted and likely lose portions, if not all, original info, even if translated correctly again. After all, some things are "lost in translation", especially incorrect translation. That's when gibberish will truly stay gibberish.

Please take your time and try things out to see what works. I'll look forward to seeing the result and testing things again.

12
You're talking about having a program read each URL, fetch that page, parse it, and then store the page title plus the URL in another bookmark format and character encoding.
...
Exactly, it was the gibberish that is the name field - there is no way around that, it starts out as gibberish, it'll remain gibberish.  The only option is to refetch the page and save the title with the correct encoding.

I've been playing around in Powershell and quite frankly, I can't get it to work consistently with some of the sites in your .cgp file.
At first I was very confused about the first sentence, since from the beginning, I was talking about just converting text from the cgp files into text stored in the html format, but now I understand where you're coming from. The thing is, fetching titles from the sites themselves wouldn't work, and not just because you have no control over the encoding any particular site chooses to use (yes, some use Unicode, but many will use a specific encoding which varies depending on the language). Most importantly, because my original "names" for the urls are more than just the titles for the pages. I used it more for notes than anything else, summarizing the site's content in a way that makes sense to me, and often include important reminders of other things. That's why it's crucial to retain the original "name" fields.

I truly appreciate your patience and varied approach in finding a solution to this challenge. ;D

I thought it would be possible to convert directly from ANSI to Unicode, but as someone without any technical background in computers, I don't really know how the process works, so I guess I over simplified things. You said if it starts out as gibberish, it would remain gibberish. Thinking back on the some of the little built-in tools I've occasionally used from other software to fix mojibake (such as this Chacon plugin for foobar2000), they have generally always required you to choose a character encoding for the source text, then an output encoding.

In Chacon, which is used to fix tags for audio files, it provides a preview window to allow you to switch between each encoding and see immediately if the characters are displayed correctly, i.e. if the correct source character encoding is chosen, prior to conversion and final overwrite to the tags themselves. It allows non-Unicode characters that was generated on one computer to be displayed correctly on another computer with a different system locale. The preview is useful due to the fact that not only do you have no idea what system locale was used on the computer from which the file originated, even when you know the actual language the characters are suppose to be in, the encoding itself doesn't always match the actual language of the characters used. With a large enough characterset, it's possible to display nearly all other characters from other languages in the same characterset intended for another language. e.g. It's possible to display Japanese characters using Chinese encoding, and maybe vice versa. (It's also why most European languages only have one "Western" charset.) However, sometimes there are some characters not included, and another important function of the preview is to allow you to see what might be lost if you chose a certain encoding, and which encoding might retain the most info, etc. I didn't think it would've been necessary to do it this way for converting from ANSI into Unicode, but I overlooked the fact that Unicode is ultimately still just another encoding, albeit one that is able to include all languages.
Chacon scrnsht.PNG

I will understand if you consider this too involved to implement. There is always the option to set it aside. I can try manually editing the "UTF-8" field for the right encoding. In your first version of the tool CGP2HTML, the html generated was able to be displayed correctly after I edited the "UTF-8" field to the correct encoding that I assume the text was originally saved in. If you just obtain the text without attempting to convert (or however it was done in the first version),  I can manually edit this field into what I think might be the proper encoding, based on the non-Unicode settings of the system it was generated on.

Case in point, on my system, the "system locale" or what Windows calls "the language setting for non-Unicode programs" varies, changing from time to time, depending on what language I need to display. It is often set to "Chinese (Simplified, PRC)", and sometimes "Japanese (Japan)". As I said, there are many different code pages even just for Simplified Chinese, and I'm not sure exactly what that translates to code page wise, but when I open a cgp file directly in the Green Browser window as a "webpage", "GB2312" is the encoding it selects automatically, and appears to display correctly. When I try to open the cgp in Slimjet, I have to manually select an encoding to have it display properly, and since GB2312 wasn't available, I chose "GBK", which is apparently just an extended version of the GB2312 charset. It also appears to display properly. In the html generated by the first version of CGP2HTML, all I did was edit UTF-8 to GBK and it displayed correctly when imported into Slimjet.

13
Hey, 4wd

Thanks so much for your continued efforts in helping and supporting this app. I got caught up with some unexpected stuff over the weekend, didn't get a chance to test the new version until now. I double-clicked the script to run it and ran into an error after choosing the folder for groups. It seems many of the commands are not supported in my Windows 7 SP1 x64 (Ult). Is there some extra dependency?

[attachimg=#1][/attachimg][attachimg=#2][/attachimg]

And to answer your earlier question about encoding. The characters in my first post are UTF-8 because they are copied and displayed in a webpage. When displayed in the web browser that you're currently viewing, they will display according to the html and/or browser settings. Chrome and many modern browsers will automatically select the correct code page for displaying text that's not in Unicode (assuming you have the code page installed), but you can also set your own display encoding manually in your browser, and that can also result in mojibake when it doesn't match the actual encoding that's being used. Since there can be multiple code pages even for the same language (due to regional differences), it makes a fairly troublesome affair, which is why most use Unicode nowadays. Anyway, what you see in a webpage is entirely separate from the cgp files, which is probably still in ANSI, the encoding that Green Browser uses by default for generating them (and I have no control over). It's also why the mismatch in encoding causes mojibake when the text from cgp gets imported into Chrome/Slimjet.

Took a closer look and it seems generation for the html was actually in the process of working when that error caused it to fail. I have here the html file generated that contains the contents of 1 full group, as well as partial contents from a second group. I've also uploaded the original cgp files of those two groups. You said the archive/extract process might be messing with the encoding. I know this is possible for the file names, but unlikely for the contents of a file. However, just to eliminate all possibilities, I've used an outside hosting site to upload the original files without compression, since the attachment format restrictions of this forum doesn't allow cgp or html.
(*links removed since they're no longer needed)

1. Unfortunately encoding conversion still isn't working right. The non-Romance characters still end up as gibberish when imported. Since the mojibake ends up with even more unusual symbols than normal being displayed when using a wrong encoding, perhaps it was one such character/symbol that caused the app to generate an error and exit? This appears to be something strangely similar to what happened with the convertor for cgp to IE favorites.

2. The unique top level "folder" name needs to be <H3> tag, not <H4>, or it will not be properly imported into Chrome/Slimjet. Any and all following folders after that also needs to be <H3> tag, preceded by <DT> and the contents enclosed by a pair of
Code: Text [Select]
  1. <DL><p> tags
The importing process doesn't recognize anything else. The only things denoting the boundaries of each folder are the
Code: Text [Select]
  1. <DL><p> and </DL><p> tag pairs
Everything following the DL tag and before /DL tag, regardless of whether it's a <H3> folder or an url, will mean that it's inside that particular "folder". So if you want to create different layers of folders, you only need to make sure the pairs are placed appropriately, which means the "folder" names always need to be <H3> tags and MUST be followed immediately by an opening DL, p tags. Current html is also missing the DL, p tags after the top level folder. The basic code I've included in my second post are what I have tested to work, so it should work as long as the format in the generated html closely follows that.

[attachimg=#3][/attachimg]

3. If it's not too much trouble, it's probably best to have the file name of the html generated include date as well as time. Otherwise there will still be a small but distinct possibility that it will conflict with an already generated file name. e.g. instead of "CGP2HTML-%hour%%min%%sec%.html", how about "CGP2HTML-%year%%month%%day%.%hour%%min%%sec%.html"?

4. One additional issue that just occurred to me when looking over my cgp files. The file names of some cgp files themselves can be in non-Romance characters. Let's consider together how we should handle this once the other issues have been resolved.

14
Added in additional notes in my original second post regarding the top level "folder" name, etc. Also updated the annotations and code to clarify things a bit more.

15
This is something quick and dirty until someone comes up with something better.  It pays no attention to strange character encodings so the Japanese text in your sample will be output as basically garbage but the HTML generated will still work.

CGP2HTML.cmd
.....

Hi, 4wd

Thank you so much for your prompt response. I honestly did not expect a reply so fast, so I didn't see the response until really late and didn't have time to work on testing until this evening.

I've tried the cmd converter. It generated a html file for the cgp I tested just fine, but unfortunately the importing afterwards didn't work. I tried manually comparing the html with an original html generated by Slimjet, and did some editing... After several hours of mounting frustrations and pointless searches on the internet, I gave myself a facepalm :D and suddenly realized what was missing. (annotated code at bottom)

Then about the number of files processed. I have several dozens of cgp files that needs to be converted. Is it possible to make the converter mass process multiple cgp files (such as all under current folder) into one html file? The <H3> tags in the html file represent "folder" names for the bookmarks under Slimjet/Chrome. Each cgp file's name can be used to create an additional <H3> folder tag of the same level, and thus multiple cgps can be incorporated into one html file. Everything also needs to be given a unique top level <H3> "folder" name, such as the date&time generated, so they will be imported under 1 folder and avoid mixing in with pre-exisiting bookmarks in Slimjet/Chrome when imported. (There can be lower level "folder" names within "folder" names, but their tag remains the same (H3), so the hierarchy and boundaries of each folder is probably determined by each corresponding opening and closing
<DL><p>
tag pairs.)

Lastly, there is the problem with the charset encoding. I'm sorry I wasn't clearer in my original post, but the non-Romance language texts needs to be preserved. The only things that are not needed are some of the extra symbols such as the interpunct dot ( ・ (U+30FB) or  ・ (U+FF65)) that are not included in the ANSI charset. The symbols are not stored properly under ANSI, resulting in their appearance as a "?" in the cgp files. Since they are only some minor symbols, it's not necessary to retain them during the conversion. The words in the non-Romance languages such as Japanese still need to be preserved however, as they retain important info for the labels of the url. Right now the conversion will result in mojibake (garbled nonsense due to incorrect character encoding). When I view source and open the generated html as text, I can view the characters just fine, but once imported into Slimjet, the browser automatically selects the encoding, and it isn't the right one, resulting in mojibake.

I may have discerned a simpler solution that will deal with both the mojibake and the extra symbols outside ANSI charset. The problem lies with choosing the right character encoding for the imported text. The original meta tags header in a Slimjet generated html file will let you set the encoding, which, in this case isn't UTF-8. However, this is a very inelegant solution that requires manually setting the encoding each time, depending on what the encoding of the imported cgp text is. A much better way to fix everything is to have the convertor first convert the text obtained into UTF-8 encoding. Then process everything into the html formatting. That way, in theory, regardless of the encoding used for the cgp files, everything should display properly, even the extra symbols.

Annotated Code fin.PNG

Code here:
Code: Text [Select]
  1. <!DOCTYPE NETSCAPE-Bookmark-file-1>
  2. <META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=UTF-8">
  3. <TITLE>Bookmarks</TITLE>
  4. <H1>Bookmarks</H1>
  5. <DL><p>
  6.     <DT><H3>(Top level unique folder name. Use date&time?)</H3>
  7.     <DL><p>
  8.         <DT><H3>(1st cgp group file name)</H3>
  9.         <DL><p>
  10.             <DT><A HREF="(url)">(name)</A>
  11.         </DL><p>
  12.         <DT><H3>(2nd cgp group file name)</H3>
  13.         <DL><p>
  14.             <DT><A HREF="(url)">(name)</A>
  15.         </DL><p>
  16. </DL><p>

16
Hello, all. I'm new here. Found the site while searching for a specific program and registered. I recently ran into this small problem, and as an average computer user who has absolutely no coding background, I'm at a total loss despite the simplicity of the issue. After many hours of frustration while trying various things with my limited knowledge, I decided to try posting here in the hopes the community can give me a hand.

I use Green Browser, which is a more of a Trident/IE based browser frontend. It can create these "groups", which are basically a more organized collection of favorites/bookmarks. I would like to have a small program to convert/export these groups into a html file such as those used by Chrome/Chromium to import as bookmarks. (More specifically, I use another custom Blink/Chromium browser called Slimjet, and would like to import to there, but I believe the format is probably universal for all Chrome/Chromium variants.)

Each Green Browser group is a bunch of urls, with each url given a name/label, and saved in a customized order. The info is stored as what I believe to be ANSI encoded text, with each group saved as an individual Windows file, the base file name being the group name and with the "cgp" custom extension. Here's an example (see attachment for original file):

File name: "Example Group.cgp"
Code: Text [Select]
  1. [Group]
  2. name0=BBC - Homepage
  3. url0=http://www.bbc.com/
  4. name1=Yahoo
  5. url1=https://www.yahoo.com/
  6. name2=Google
  7. url2=https://www.google.com/?gws_rd=ssl
  8. name3=CNN.co.jp : NASAの「空飛ぶ円盤」、ハワイで飛行実験
  9. url3=http://www.cnn.co.jp/fringe/35065380.html?tag=rcol;editorSelect

As for the Chrome bookmarks html file, I've also included a sample export. (Please see attached file for reference, the contents are too long to post here.) I don't know much about html formatting, but after analyzing the content, I believe, the main thing that's required to have Slimjet (Chrome) recognize the urls and their names, is to have each be reformatted from the above into such a format:

Code: Text [Select]
  1. <DT><A HREF="(url)">(name)</A>

The key things are to preserve each url, its name/label, and of course the order in which the urls are stored in for each group. I think you can safely ignore and exclude the other tags, such as "add date"; and of course the final file still needs the proper titles/meta tags to designate it as a html, but I'm not sure what exactly is required, as I'm don't know enough about html formatting.

Additionally, each group name can be stored as a folder name for the bookmarks, which would be at a different level than the individual urls in the html formatting, and lastly everything should be stored under a unique top level folder name, perhaps based on the date/time generated, or allow user input, so as to avoid overwriting or mixing in with existing bookmarks when importing.

Finally, a crucial thing to note is that although the groups are stored in *what I believe* to be ANSI encoding (someone can examine the file and correct me if I'm wrong), they actually can contain any number of characters, some of which are not included in ANSI, such as this  ・ (U+30FB) or  ・ (U+FF65). (If you cannot correctly display the symbol, see the wikipedia link here and jump to the Japanese section.) They are sometimes stored as just a "?" in a Green Browser group file due to the limits of the ANSI encoding, and there can be quite a few of them. The program needs to be able to handle the symbols, but does not need to preserve or restore the original symbols. It merely needs to be able to at least put a space in the place of these, or remove them (ideally, an program setting option to decide which would be best). I have on hand a small program that's suppose to be able to convert from Green Browser groups into IE favorites, and it would generate errors and fail at the conversion randomly, likely due to these characters. I've attached it here as well if someone wants a reference point. The program I'm requesting to convert to Chrome bookmarks html needs to be able to handle these symbols without failing in the same way.

Once again, the main goal is I really need something to help me convert a large number of Green Browser groups files into an html file that can be imported for Chrome (Slimjet) bookmarks. If anyone can help me out with this, then after that, perhaps other things can be added, such as support for converting back from bookmarks into groups, and perhaps adding support for other browsers that have groups with similar formatting, IE favorites, etc. Your help would be tremendously appreciated.

Pages: [1]