--- Log opened Fri Jul 27 00:00:20 2007 |
01:05 | < ToxicFrog> | AhahahahahahaHAHAHAHAHAHAHAHA VICTORY IS MINE |
01:19 | | ReivSLEP is now known as ReivClass |
01:20 | <@Vornicus> | oh? |
01:22 | < ToxicFrog> | In a matter than involves piping curl through awk, egrep, uniq, a code block, sort, another code block that filters through egrep and sed, and yet a third code block. |
01:23 | < ToxicFrog> | Consider, if you will, a site that hosts fan-translations of manga series. |
01:24 | <@Vornicus> | Oh no you didn't. |
01:24 | < ToxicFrog> | But, shock horror, each manga is displayed one page at a time - no browser-crushing single HTML page, no easily downloadable CB* archives. |
01:24 | < ToxicFrog> | I totally did. |
01:24 | <@Vornicus> | You, sir |
01:24 | <@Vornicus> | Are the bomb. |
01:25 | <@Vornicus> | Which one is this? |
01:25 | < ToxicFrog> | Depending on the arguments given, it will do one of: download a complete manga series; generate one named HTML page per chapter; generate one named CBT archive per chapter (and clean up the raw images afterwards); or unpack the CBT archives back into the raw images. |
01:26 | <@Vornicus> | I never liked that about bzip, that it would delete itself when you were done. |
01:26 | < ToxicFrog> | It's a one-line change. |
01:26 | <@Vornicus> | I know. |
01:26 | < ToxicFrog> | And the site is onemanga.com |
01:26 | < ToxicFrog> | (I mean, in this script) |
01:27 | < ToxicFrog> | (which is easier to edit than bzip2) |
01:28 | < ToxicFrog> | Anyways. I may add a command line option to toggle that. |
01:28 | < ToxicFrog> | But the core is done. |
01:29 | <@Vornicus> | oh |
01:29 | < ToxicFrog> | I am currently testing it by downloading all 175 extant chapters of Yakitate!! Japan >.> |
01:29 | <@Vornicus> | hee |
01:43 | < ToxicFrog> | whoops |
01:43 | <@Vornicus> | ? |
01:43 | | * ToxicFrog makes a typo and hoses his offline copy of Claymore |
01:43 | <@Vornicus> | hose how? |
01:44 | < ToxicFrog> | hose rm -rvf. |
01:44 | <@Vornicus> | oh, cool. |
01:45 | < ToxicFrog> | Fortunately, this is easy to fix! |
01:45 | < ToxicFrog> | $ omdl Claymore html pack |
01:45 | <@Vornicus> | rock |
01:45 | < ToxicFrog> | Takes a while, but then, I've already read it. |
01:46 | < ToxicFrog> | And I think, although I have yet to test this, that you can download individual issues as a side effect of the way it constructs the URL. |
01:46 | < ToxicFrog> | Hmm, no you can't, the directory name generator gets confused. |
01:46 | | * ToxicFrog fixes this |
02:00 | < ToxicFrog> | Hmm. |
02:00 | < ToxicFrog> | Packing '001 - The Silver-eyed Killer'...tar: Cowardly refusing to create an empty archive |
02:00 | < ToxicFrog> | Try `tar --help' or `tar --usage' for more information. |
02:01 | | gnolam [lenin@Nightstar-13557.8.5.253.se.wasadata.net] has quit [Quit: Z¨?] |
02:07 | < ToxicFrog> | ...what the shit |
02:07 | < ToxicFrog> | The page format appears to have changed while the script is running |
02:07 | < ToxicFrog> | ;.; |
02:07 | < ToxicFrog> | Oh, wait, no, it's a bug in my script~ |
02:07 | <@Vornicus> | heh |
02:07 | < ToxicFrog> | (and a bug in their webserver, but) |
02:08 | <@Vornicus> | heh again |
02:09 | < ToxicFrog> | Ok. Claymore and Yakitate!! Japan both happily downloading. |
02:09 | < ToxicFrog> | (those !! were great for finding quoting bugs, too~) |
02:12 | < ToxicFrog> | I never thought a manga about bread making could be so gripping. |
02:14 | <@Vornicus> | what, yakitate!! Japan is about bread making? |
02:14 | < ToxicFrog> | Yes. |
02:14 | <@Vornicus> | I should read this. |
02:14 | < ToxicFrog> | A young man who looks at french bread, and italian bread, and so forth, and bemoans the fact that there is no bread that Japan is known for. |
02:15 | < ToxicFrog> | And thus, he goes forth into the world! |
02:15 | < ToxicFrog> | To create a bread that will become synonymous with the name of Japan! |
02:15 | <@Vornicus> | Can I have your script of mass evil? |
02:16 | <@Vornicus> | what's the requirements - just the text tools? |
02:16 | < ToxicFrog> | Syntax: omdl <manga name>[/<chapter>] operations... |
02:17 | <@Vornicus> | okay, what's the official name of Yakitate!! Japan for this? |
02:17 | < ToxicFrog> | Ummm...egrep wget find tar curl sort uniq awk |
02:17 | < ToxicFrog> | 'Yakitate!! Japan' |
02:17 | < ToxicFrog> | Be sure to use singlequotes, or it will explode. |
02:17 | < ToxicFrog> | Operations are: download pack unpack html |
02:18 | < ToxicFrog> | So, omdl foo download will download all of foo to foo/raw |
02:18 | <@Vornicus> | okay, so I want, what, download html? |
02:18 | <@Vornicus> | to get HTML pages? |
02:18 | < ToxicFrog> | omdl foo pack will pack foo/raw into foo/cbt/*.cbt (and leave foo/raw untouched) |
02:18 | < ToxicFrog> | Yes. |
02:18 | <@Vornicus> | ok |
02:18 | < ToxicFrog> | Download html will download the images into foo/raw, and then generate a bunch of HTML pages in foo/html for reading them issue by issue. |
02:19 | < ToxicFrog> | These can have upwards of 100 images per page, so a robust browser is recommended. |
02:19 | <@Vornicus> | [Gardner:~/Downloads] vorn% omdl 'Yakitate!! Japan' download html |
02:19 | <@Vornicus> | omdl 'Yakitatewhich curl Japan' download html |
02:19 | <@Vornicus> | that didn't work. |
02:19 | < ToxicFrog> | ...what shell are you using? |
02:20 | < ToxicFrog> | '' works in bash. |
02:20 | <@Vornicus> | tcsh. |
02:20 | < ToxicFrog> | Aah. |
02:20 | < ToxicFrog> | Use whatever you need to get tcsh to not expand !!, then. |
02:20 | <@Vornicus> | Mainly because that was the default in 10.2, and I haven't bothered finding the place to change that. |
02:21 | | * Vornicus just goes "bash" |
02:21 | < ToxicFrog> | Heh. |
02:22 | < ToxicFrog> | I note that it persists, so 'download' followed by 'html' has the same effect as 'download html' |
02:22 | <@Vornicus> | awesome |
02:23 | <@Vornicus> | 21:30:06 URL:http://www.onemanga.com/Yakitate!!_Japan/93/ [46632] -> "www.onemanga.com/Yakitate!!_Japan/93/index.html" [1] <--- what's this doing? |
02:23 | < ToxicFrog> | Downloading it. |
02:23 | < ToxicFrog> | Saving http://www.onemanga.com/Yakitate!!_Japan/93/ to ./www.onemanga.com/Yakitate!!_Japan/93/index.html |
02:24 | <@Vornicus> | oh, ok |
02:24 | <@Vornicus> | what's the numbers? |
02:24 | < ToxicFrog> | The numbers in []? |
02:24 | <@Vornicus> | yes |
02:24 | < ToxicFrog> | File size, I believe. |
02:24 | < ToxicFrog> | Well, the first one |
02:24 | < ToxicFrog> | The second one is clobber index, I think, and is always 1 for normal operation. |
02:25 | <@Vornicus> | Ah, this I see shall take a while. |
02:26 | < ToxicFrog> | 175 chapters, 30-40 pages per. |
02:26 | < ToxicFrog> | It will take quite some time. |
02:26 | < ToxicFrog> | If you want a quick preview, try: omdl 'Yakitate!! Japan/1' download html |
02:26 | < ToxicFrog> | Which will download only the first chapter. |
02:27 | <@Vornicus> | a good idea |
02:28 | | Chalcedon is now known as ChalcyCleaning |
02:29 | < ToxicFrog> | Oh dear, I appear to have exploded Opera. |
02:29 | <@Vornicus> | gah. you're using flags in mv and sed that mine apparently doesn't like. |
02:30 | <@Vornicus> | (-t and -r respectively) |
02:30 | < ToxicFrog> | -r in sed is non-negotiable, it enables EREs. |
02:30 | <@Vornicus> | mine says -E |
02:30 | < ToxicFrog> | Well, rather, it's negotiable if you're willing to change all the sed calls to use BREs. |
02:30 | | * Vornicus changes |
02:31 | < ToxicFrog> | Aah. |
02:31 | < ToxicFrog> | Mine doesn't have that. |
02:31 | < ToxicFrog> | (4.1.5) |
02:31 | < ToxicFrog> | And -t to mv lets you go: mv -t dest source source source ... |
02:32 | < ToxicFrog> | You can replace that with: mv img.onemanga.com/mangas/*/* raw/ |
02:32 | < ToxicFrog> | Assuming that your mv supports that. |
02:33 | | * ToxicFrog puts Opera out of its misery, listens to the THWONK of 400MB of memory being freed. |
02:33 | <@Vornicus> | mv [-f | -i | -n] [-v] source ... directory |
02:33 | < ToxicFrog> | Yep, that should work then |
02:33 | <@Vornicus> | ok |
02:35 | < ToxicFrog> | (I love the way how, as soon as I said "consider a site that...", you said "Oh no you didn't") |
02:35 | <@Vornicus> | hee |
02:36 | < ToxicFrog> | Although this is only, what, my third site ripper? |
02:37 | <@Vornicus> | some such silliness. |
02:37 | < ToxicFrog> | Galbadia Hotel, Narbonic...I think that's it. |
02:37 | < ToxicFrog> | Stuff that only invokes wget doesn't count~ |
02:40 | < ToxicFrog> | Anyways. Once my tests are done, I will post this to l. |
02:40 | < ToxicFrog> | *lj. |
02:40 | <@Vornicus> | Okay, that run seems to have worked just fine. |
02:41 | <@Vornicus> | ...you know, this would be easier to read if the scanslator flipped the images so that the sequence is left-to-right. |
02:42 | < ToxicFrog> | Depending on the manga, that can break things. |
02:42 | < ToxicFrog> | And it's not that hard to adapt to. |
02:42 | <@Vornicus> | Yeah, I know. |
02:42 | < ToxicFrog> | The main problem I have is adapting back when I finish. |
02:42 | < ToxicFrog> | I find myself reading my webcomics right to left. |
02:43 | <@Vornicus> | THat must suck when there's a punchline. |
02:43 | < ToxicFrog> | Yep. |
02:45 | < ToxicFrog> | Less so with stuff like GG, which has multiple rows. |
02:46 | < ToxicFrog> | Since I generally catch myself partway down and relex. |
02:46 | <@Vornicus> | ...hee. "THIS IS" *speedlines* "WHAT REAL BREAD" *more speedlines* "TASTES LIKE" *still more speedlines* |
02:50 | | * ToxicFrog eyes episode 9 |
02:50 | < ToxicFrog> | Oh, it is on |
02:50 | <@Vornicus> | This scene, it reminds me of Tomo-san from Azumanga. |
02:50 | < ToxicFrog> | ...and Ken-sensei ups the ante! |
02:50 | < ToxicFrog> | A BREAD DUEL! |
02:51 | < ToxicFrog> | Two bakers, locked in a battle to the death! |
02:51 | < ToxicFrog> | Or at least to the tasting! |
02:51 | <@Vornicus> | It doesn't count if Chairman Kaga isn't there. |
02:51 | < ToxicFrog> | Never read Azumanga. |
02:51 | <@Vornicus> | Tomo is, uh, in her own words, a "berzerker asshole" |
02:52 | <@Vornicus> | she makes many things more, uh, speedliney, than they ought to be. |
02:52 | < ToxicFrog> | Aah. |
02:52 | | * Vornicus likes that description. |
03:02 | < ToxicFrog> | I did not expect bread to be this interesting, but it has completely preempted Berserk and Death Note. |
03:04 | | * Vornicus finishes chapter 1. |
03:05 | | * Vornicus determines that it shall be read. |
03:12 | | * Vornicus also determines that he /really/ has to find more storage space. |
03:15 | < ToxicFrog> | Heh. Likewise. |
03:16 | < ToxicFrog> | Only six more drives to go before I can expand the array... |
03:16 | <@Vornicus> | pfffaaahaahaa |
03:16 | < ToxicFrog> | Filesystem Size Used Avail Use% Mounted on |
03:16 | < ToxicFrog> | /dev/md0 198G 82G 106G 44% / |
03:16 | < ToxicFrog> | /dev/mapper/orias-static |
03:16 | < ToxicFrog> | 330G 328G 2.2G 100% /static |
03:17 | < ToxicFrog> | http://www.onemanga.com/Yakitate!!_Japan/14/10/ |
03:18 | < ToxicFrog> | I suspect Azuma has SRMD. |
03:32 | < ToxicFrog> | Awesome! |
03:32 | < ToxicFrog> | Some of the omake are bread recipes. |
03:34 | <@Vornicus> | awesome |
04:45 | | RBot [~Reiver@Nightstar-3527.ubs-dsl.xnet.co.nz] has joined #Code |
04:46 | | ReivClass [~reaverta@IRCop.Nightstar.Net] has quit [Ping Timeout] |
04:46 | | DiceBot [~Reiver@Nightstar-28711.ubs-dsl.xnet.co.nz] has quit [Ping Timeout] |
04:47 | | RBot is now known as DiceBot |
04:53 | | ReivClass [~reaverta@IRCop.Nightstar.Net] has joined #Code |
05:05 | | ChalcyCleaning is now known as ChalcyAFK |
05:54 | | Vornicus is now known as Vornicus-Latens |
05:56 | | ChalcyAFK [~Chalcedon@Nightstar-10622.ue.woosh.co.nz] has quit [Connection reset by peer] |
05:56 | | Chalcedon [~Chalcedon@Nightstar-10622.ue.woosh.co.nz] has joined #code |
05:57 | | mode/#code [+o Chalcedon] by ChanServ |
06:55 | | Forj [~Forj@Nightstar-10622.ue.woosh.co.nz] has joined #code |
07:14 | | You're now known as TheWatcher |
09:08 | | gnolam [lenin@Nightstar-13557.8.5.253.se.wasadata.net] has joined #Code |
09:08 | | mode/#code [+o gnolam] by ChanServ |
10:09 | | AnnoDomini [~farkoff@Nightstar-29556.neoplus.adsl.tpnet.pl] has quit [Ping Timeout] |
10:15 | | AnnoDomini [~farkoff@Nightstar-29655.neoplus.adsl.tpnet.pl] has joined #Code |
10:33 | | Chalcy [~Chalcedon@Nightstar-10622.ue.woosh.co.nz] has joined #code |
10:34 | | Chalcedon [~Chalcedon@Nightstar-10622.ue.woosh.co.nz] has quit [Killed (NickServ (GHOST command used by Chalcy))] |
10:34 | | Chalcy is now known as Chalcedon |
10:35 | | Chalcedon is now known as NSGuest-240 |
10:36 | | NSGuest-240 is now known as Chalcedon |
10:42 | | Forj [~Forj@Nightstar-10622.ue.woosh.co.nz] has quit [Quit: Gone] |
10:51 | | gnolam [lenin@Nightstar-13557.8.5.253.se.wasadata.net] has quit [Quit: Ach, zombies!] |
10:58 | | Chalcedon [~Chalcedon@Nightstar-10622.ue.woosh.co.nz] has quit [Quit: Gone] |
11:34 | | ReivClass [~reaverta@IRCop.Nightstar.Net] has quit [Quit: Changing servers] |
11:34 | | ReivClass [~reaverta@IRCop.Nightstar.Net] has joined #Code |
11:35 | | ReivClass is now known as Reiver |
12:30 | | gnolam [lenin@Nightstar-13557.8.5.253.se.wasadata.net] has joined #Code |
12:30 | | mode/#code [+o gnolam] by ChanServ |
13:14 | | KarmaBot [~fark.off@87.72.35.ns-3885] has joined #Code |
13:14 | | mode/#code [+v KarmaBot] by ChanServ |
13:28 | | ToxicFrog [~ToxicFrog@Admin.Nightstar.Net] has quit [Operation timed out] |
13:38 | | gnolam [lenin@Nightstar-13557.8.5.253.se.wasadata.net] has quit [Quit: FZZZZZZZZZzzzzt+++NOCARRIER] |
13:44 | | gnolam [lenin@Nightstar-13557.8.5.253.se.wasadata.net] has joined #Code |
13:44 | | mode/#code [+o gnolam] by ChanServ |
13:45 | | gnolam [lenin@Nightstar-13557.8.5.253.se.wasadata.net] has quit [Quit: Dammit, it hadn't passed.] |
13:54 | | gnolam [lenin@Nightstar-13557.8.5.253.se.wasadata.net] has joined #Code |
13:54 | | mode/#code [+o gnolam] by ChanServ |
14:15 | | ToxicFrog [~ToxicFrog@Admin.Nightstar.Net] has joined #code |
14:15 | | mode/#code [+o ToxicFrog] by ChanServ |
15:15 | | Reiver is now known as ReivZzz |
15:42 | | mode/#code [+o TheWatcher] by ChanServ |
16:13 | | AnnoDomini [~farkoff@Nightstar-29655.neoplus.adsl.tpnet.pl] has quit [Ping Timeout] |
16:21 | | AnnoDomini [~farkoff@Nightstar-29503.neoplus.adsl.tpnet.pl] has joined #Code |
16:28 | | mode/#code [+o AnnoDomini] by ChanServ |
16:37 | | AnnoDomini [~farkoff@Nightstar-29503.neoplus.adsl.tpnet.pl] has quit [Ping Timeout] |
16:44 | | AnnoDomini [~farkoff@Nightstar-29136.neoplus.adsl.tpnet.pl] has joined #Code |
16:45 | | NSGuest-47 is now known as jerith |
16:51 | | mode/#code [+o AnnoDomini] by ChanServ |
17:10 | | You're now known as TheWatcher[afk] |
18:20 | | You're now known as TheWatcher |
19:30 | | AnnoDomini is now known as Lerhir |
20:19 | < jerith> | 21:26 < iddqd> "If Java had true garbage collection, most programs would delete themselves upon execution." |
20:20 | <@Lerhir> | Pfft. |
20:20 | | * jerith reinflates Lerhir. |
20:21 | | * Lerhir has so far made a decent DnD creature sheet generator in Java. In the process, he went from RealJ to DOS shell and Notepad. |
20:23 | <@ToxicFrog> | |
20:24 | <@Lerhir> | What? |
20:25 | <@ToxicFrog> | Notepad? |
20:26 | < jerith> | Dude, gvim. Or emacs. |
20:26 | <@Lerhir> | Well, it's called Notepad2. Major difference from Notepad is that it has search/replace. |
20:29 | | * Lerhir is suspicious of emacs, especially what with the image in the topic. vim-likes earned his dislike this semester, when he had to use them in OpSys class. |
20:31 | <@gnolam> | But still... Notepad? |
20:32 | <@gnolam> | At least use SciTE or something. |
20:32 | <@Lerhir> | Naw, Notepad is adequate. |
20:34 | < jerith> | Syntax highlighting ftw. |
20:35 | <@Lerhir> | Oh, yeah, this clone has that. |
20:35 | <@ToxicFrog> | Identation? |
20:35 | <@ToxicFrog> | Scriptability? |
20:36 | <@Lerhir> | Indentation is present. Dunno what scriptability is. |
20:36 | <@ToxicFrog> | the ability to extend the editor beyond its default capabilities by writing new commands in an internal scripting language or by calling out to the shell. |
20:36 | <@ToxicFrog> | Anyways, it sounds like notepad2 has little in common with notepad. |
20:36 | <@Lerhir> | No, I'm pretty sure it doesn't have that. |
20:37 | <@Lerhir> | I usually compare text editors mainly by how fast they boot up. ;) |
20:38 | <@gnolam> | SciTE is pretty darn near instantaneous. |
20:38 | < jerith> | ... |
20:38 | <@Lerhir> | So's this clone. |
20:38 | <@Lerhir> | Notepad is instantaneous. I rate all other things by it. :P |
20:40 | <@ToxicFrog> | I can't think of any editors that aren't. |
20:41 | | * ToxicFrog sets about building a linux-host/win32-target cross-compilation environment |
20:41 | <@Lerhir> | EditPad Lite takes its time. |
20:43 | < jerith> | Emacs takes a few seconds. |
20:43 | <@gnolam> | And your sanity. |
20:43 | <@Lerhir> | Its learning curve is complex. |
20:43 | <@gnolam> | Non-Euclidean. |
21:01 | < EvilDarkLord> | Any SQLy people about? |
21:01 | <@Lerhir> | Before I parsed your name, I wanted to refer you to EvilDarkLord. |
21:02 | | ReivZzz is now known as ReivOut |
21:04 | < EvilDarkLord> | Okay, in case someone gets around to answering. I have three tables, call them a, b, and c. There are many rows in b and c for each row in a. I want to join the tables and get the results corresponding to the first 100 rows of a. |
21:05 | < jerith> | Subselect? |
21:06 | < EvilDarkLord> | Also, a way to keep rowcount at a*max(b, c) instead of a*b*c. |
21:06 | < jerith> | select * from (select * from a limit 100) join b join c; |
21:07 | < EvilDarkLord> | Ah, yes. Thanks! |
21:07 | < jerith> | The latter I have no idea about. My SQL-fu is weak. |
21:48 | | * gnolam ponders Erlang. |
21:49 | < jerith> | 'Tis well worth the ponderage. |
21:50 | <@gnolam> | Thing is, I can't think of a worthwhile project to help me learn it. |
21:55 | <@gnolam> | That's what the pondering is about. |
22:41 | | You're now known as TheWatcher[T-2] |
22:44 | | Chalcedon [~Chalcedon@Nightstar-10622.ue.woosh.co.nz] has joined #code |
22:44 | | mode/#code [+o Chalcedon] by ChanServ |
22:45 | | Forj [~Forj@Nightstar-10622.ue.woosh.co.nz] has joined #code |
22:46 | | You're now known as TheWatcher[zZzZ] |
23:36 | | Syloqs-AFH [Syloq@Admin.Nightstar.Net] has quit [Ping Timeout] |
23:37 | | Syloq [Syloq@Admin.Nightstar.Net] has joined #code |
23:39 | | Syloq is now known as Syloqs-AFH |
23:43 | | Vornicus-Latens is now known as Vornicus |
--- Log closed Sat Jul 28 00:00:26 2007 |