Public Class Public Static Int : Scanner - Inner - Buffer 1024 Delimiter

The document discusses a LargeStringScanner class that is implemented to address limitations in the java.util.Scanner class. The java.util.Scanner class has a fixed internal buffer of 1024 characters, so it cannot correctly scan strings larger than this. The LargeStringScanner splits long strings into multiple parts of around 1024 characters each. It then scans each part separately using Scanner and stitches the results back together. Found lexemes are replaced with their uppercase version enclosed in "<>" tags.

Uploaded by

DanijelBara

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as RTF, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views3 pages

Public Class Public Static Int : Scanner - Inner - Buffer 1024 Delimiter

Uploaded by

DanijelBara

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as RTF, PDF, TXT or read online on Scribd

You are on page 1/ 3

/**

java.util.Scanner is a
great tool for parsing but it has
some disadvantages. One of
them is
unchangable buffer with
length 1024. It means that
working with strings bigger
than 1024 will not
be correct - in fact only
the first 1024 symbols will be
scanned. Also java.util.Scanner
class is final,
so overriding methods is
not awailable.
Here is a draft
implementation of solution
when whole text is splitted to N
parts on lexeme the nearest to
n*1024th symbol and
each part is scanned
seperately. In this example
found lexeme is being replaced
with upper case
and enclosed with "<>"
*/
public class LargeStringScanner
{
public static int
SCANNER_INNER_BUFFER =
1024; //inner buffer of
java.util.scanner
public static String DELIMITER
= "\\b"; //lexeme delimiter
regex
public String enhance(String
body) {
if (body == null ||
body.isEmpty) return;
String enhancedBody = "";
int bodyPartStartPosition =
0;

while
(bodyPartStartPosition <
body.length()) {
int bodyPartEndPosition
= bodyPartStartPosition +
SCANNER_INNER_BUFFER;
bodyPartEndPosition =
body.length() >
bodyPartEndPosition ?
bodyPartEndPosition - 1 :
body.length();
bodyPartEndPosition =
findLastDelimiterInSubString(bo
dyPartStartPosition,
bodyPartEndPosition, body);
String subBody =
body.substring(bodyPartStartPo
sition, bodyPartEndPosition);
Scanner scanner = new
Scanner(subBody).useDelimiter
(Pattern.compile(DELIMITER));
String
enhancedBodyPart = "";
int charsRead = 0;
while
(scanner.hasNext()) {
String word =
scanner.next();
enhancedBodyPart
+=
subBody.substring(charsRead,
scanner.match().start());
word =
doSmthWithLexeme(word);
enhancedBodyPart
+= word;

charsRead =
scanner.match().end();
}
enhancedBodyPart +=
subBody.substring(charsRead);
enhancedBody +=
enhancedBodyPart;
bodyPartStartPosition =
bodyPartEndPosition;
}
return enhancedBody;
}

private int
findLastDelimiterInSubString(int
startPosition, int endPosition,
String largeString) {
for (int i = endPosition - 1; i
> startPosition; i--) {
if
(Pattern.matches(DELIMITER,
Character.toString(largeString.c
harAt(i)))) {
endPosition = i;
break;
}
}
return endPosition;
}
protected String
doSmthWithLexeme(String
lexeme) {
return "<" +
lexeme.toUpperCase() + ">";
}
}

Ricardo Vargas Simplified Pmbok Flow 6ed CANVAS EN-A3 PDF
No ratings yet
Ricardo Vargas Simplified Pmbok Flow 6ed CANVAS EN-A3 PDF
1 page
Prog2 Substring Removal
No ratings yet
Prog2 Substring Removal
3 pages
Verb at Ives
No ratings yet
Verb at Ives
4 pages
EPREUVE D'ANGLAIS CLASSE DE 4EME 2EME DEVOIR DU 2EME TRIMESTRE 2023-2024 CPEG SAINT JUSTIN
No ratings yet
EPREUVE D'ANGLAIS CLASSE DE 4EME 2EME DEVOIR DU 2EME TRIMESTRE 2023-2024 CPEG SAINT JUSTIN
2 pages
last1
No ratings yet
last1
3 pages
Lab 5 (Latest-ByAman)(For Students)
No ratings yet
Lab 5 (Latest-ByAman)(For Students)
5 pages
GMEI Utility Wire Payment Instructions EUR
100% (1)
GMEI Utility Wire Payment Instructions EUR
1 page
Aids to Trade Project
No ratings yet
Aids to Trade Project
10 pages
a
No ratings yet
a
4 pages
Thurs Batch Class X 8th May Progs
No ratings yet
Thurs Batch Class X 8th May Progs
11 pages
FA23-BSE-126
No ratings yet
FA23-BSE-126
4 pages
Wipro TalentNext Java Full Stack
No ratings yet
Wipro TalentNext Java Full Stack
12 pages
Wipro TalentNext Java Full Stack
No ratings yet
Wipro TalentNext Java Full Stack
12 pages
Learning Curve Tutorial Activity
No ratings yet
Learning Curve Tutorial Activity
2 pages
j4
No ratings yet
j4
8 pages
MSC Indonesia Communication Matrix - 2024_2
No ratings yet
MSC Indonesia Communication Matrix - 2024_2
6 pages
Infix Program
No ratings yet
Infix Program
2 pages
Class Running Notes 22nd Sept
No ratings yet
Class Running Notes 22nd Sept
7 pages
pract4
No ratings yet
pract4
5 pages
C2ex Java
No ratings yet
C2ex Java
6 pages
11&12 java
No ratings yet
11&12 java
7 pages
Case Study 4
No ratings yet
Case Study 4
2 pages
Compiler Lab Assignment-4
No ratings yet
Compiler Lab Assignment-4
3 pages
Formal Letter Format To University
100% (1)
Formal Letter Format To University
7 pages
Midsem + secondpart
No ratings yet
Midsem + secondpart
100 pages
Basic Google Hacking
100% (1)
Basic Google Hacking
31 pages
Kalpesh 6
No ratings yet
Kalpesh 6
3 pages
Scanner Solution
No ratings yet
Scanner Solution
4 pages
من المفترض ان ده حل الكويز بس بيقع في كذا تيست
No ratings yet
من المفترض ان ده حل الكويز بس بيقع في كذا تيست
4 pages
Form 5 Ok
No ratings yet
Form 5 Ok
3 pages
Online Java Compiler IDE: For Multiple Files, Custom Library and File Read/Write, Use Our New - Advanced Java IDE
No ratings yet
Online Java Compiler IDE: For Multiple Files, Custom Library and File Read/Write, Use Our New - Advanced Java IDE
1 page
Q1W1 - (E) Test For Fat
No ratings yet
Q1W1 - (E) Test For Fat
2 pages
Lab6a Explain
No ratings yet
Lab6a Explain
3 pages
Stack - Notes
No ratings yet
Stack - Notes
5 pages
Tpcomp 001
No ratings yet
Tpcomp 001
3 pages
Free ESL Play PDF
No ratings yet
Free ESL Play PDF
2 pages
Immuno
No ratings yet
Immuno
4 pages
COMP 248 Lab Exercise 5.1 - H - I - L Due Date
No ratings yet
COMP 248 Lab Exercise 5.1 - H - I - L Due Date
2 pages
FunctionOnStrings
No ratings yet
FunctionOnStrings
5 pages
Testing
No ratings yet
Testing
3 pages
Programs (Strings)
No ratings yet
Programs (Strings)
5 pages
Java
No ratings yet
Java
38 pages
LR Parsing Algorithm (Pseudocode) : Festin, Kerr Oliver Bscs 2A
No ratings yet
LR Parsing Algorithm (Pseudocode) : Festin, Kerr Oliver Bscs 2A
13 pages
An Introduction To Design Thinking: Corey Ford Cford@stanford - Edu
No ratings yet
An Introduction To Design Thinking: Corey Ford Cford@stanford - Edu
34 pages
تشابتر
No ratings yet
تشابتر
4 pages
As7001 00 As7002 00
No ratings yet
As7001 00 As7002 00
1 page
Hotel Management
88% (25)
Hotel Management
44 pages
Coconut Curry Chicken (Super Easy!) - Downshiftology
No ratings yet
Coconut Curry Chicken (Super Easy!) - Downshiftology
2 pages
Contoh Program Scanner
No ratings yet
Contoh Program Scanner
6 pages
Research Project Proposal by Slidesgo
No ratings yet
Research Project Proposal by Slidesgo
45 pages
HEE Publication Survey
No ratings yet
HEE Publication Survey
5 pages
labbb
No ratings yet
labbb
5 pages
Ds Eternus Dx60 s4 WW en
No ratings yet
Ds Eternus Dx60 s4 WW en
6 pages
Innovation in Retail Banking The Emergence of New Banking Business Models
No ratings yet
Innovation in Retail Banking The Emergence of New Banking Business Models
5 pages
TheWalletProjectB&W2012 PDF
No ratings yet
TheWalletProjectB&W2012 PDF
7 pages
Class Sentence - ICSE - PROJECT
No ratings yet
Class Sentence - ICSE - PROJECT
1 page
10 beans
No ratings yet
10 beans
32 pages
HPD Runolist
No ratings yet
HPD Runolist
12 pages
StringsQ1-11
No ratings yet
StringsQ1-11
16 pages
Compiler Design & Construction Term Project: Part 1
No ratings yet
Compiler Design & Construction Term Project: Part 1
10 pages
Sanjunath R Gajare
No ratings yet
Sanjunath R Gajare
2 pages
Jet Audio
No ratings yet
Jet Audio
2 pages
Complex Problems Sol Me
No ratings yet
Complex Problems Sol Me
17 pages
5S Filipino &amp English
100% (9)
5S Filipino &amp English
8 pages
7.4 Certifications Dec15
No ratings yet
7.4 Certifications Dec15
6 pages
Design Thinking What It Is Owen Korea05
No ratings yet
Design Thinking What It Is Owen Korea05
17 pages
TheGiftGivingProjectB&W2012 PDF
No ratings yet
TheGiftGivingProjectB&W2012 PDF
6 pages
Digital Techzagreb28112017 Carmine Nigro5a1fb8e604d75
No ratings yet
Digital Techzagreb28112017 Carmine Nigro5a1fb8e604d75
45 pages
TMT DATA Protection Survival Guide Singles PDF
No ratings yet
TMT DATA Protection Survival Guide Singles PDF
56 pages
MATERIAL FOR STRING REVISION
No ratings yet
MATERIAL FOR STRING REVISION
36 pages
AIS Report
No ratings yet
AIS Report
33 pages
2ce72bd5 30aa 4405 Ab1b 26bf23fd59ae All Programs Comp Sem II
No ratings yet
2ce72bd5 30aa 4405 Ab1b 26bf23fd59ae All Programs Comp Sem II
33 pages
Computer Assignment
No ratings yet
Computer Assignment
19 pages
Random Variables and Probability Distribution: Purnomo Jurusan Teknik Mesin UGM
No ratings yet
Random Variables and Probability Distribution: Purnomo Jurusan Teknik Mesin UGM
48 pages
Critical Thinking Ch.1 DR - Robert Todd Carroll
100% (2)
Critical Thinking Ch.1 DR - Robert Todd Carroll
27 pages
String Solved Sums Roshan
No ratings yet
String Solved Sums Roshan
32 pages
Notes Natural Vegetation
No ratings yet
Notes Natural Vegetation
3 pages
Pohekar 2004
No ratings yet
Pohekar 2004
17 pages
SAMRIDHI
No ratings yet
SAMRIDHI
115 pages
Identifying The Levels of Strategy For A College or University
No ratings yet
Identifying The Levels of Strategy For A College or University
7 pages
16 MPCB
No ratings yet
16 MPCB
21 pages
Prayanam V1.0
No ratings yet
Prayanam V1.0
21 pages
NBFC Companies
No ratings yet
NBFC Companies
1,608 pages
CISSP Training Guide - Robert Bragg
No ratings yet
CISSP Training Guide - Robert Bragg
75 pages
List of Countries, Nationalities and Their Languages: Country Nationality (Adjective) Nationailty (Noun) Language
No ratings yet
List of Countries, Nationalities and Their Languages: Country Nationality (Adjective) Nationailty (Noun) Language
4 pages
JavaScript.
From Everand
JavaScript.
Tom Henricksen
No ratings yet
Wolf TinsmanBarryDavis DesignThinkingWorkshop
No ratings yet
Wolf TinsmanBarryDavis DesignThinkingWorkshop
66 pages
JavaScript & Vue.js: A Match Made in Heaven
From Everand
JavaScript & Vue.js: A Match Made in Heaven
Tom Henricksen
No ratings yet
One Last Cry Lyric
No ratings yet
One Last Cry Lyric
13 pages
Green Paper - EU GDPR Compliance Guide PDF
0% (1)
Green Paper - EU GDPR Compliance Guide PDF
10 pages
All India Dealer Price List Wef 15.11.2023
100% (2)
All India Dealer Price List Wef 15.11.2023
10 pages
The Joy of JavaScript With a Side of Vue.js
From Everand
The Joy of JavaScript With a Side of Vue.js
Tom Henricksen
No ratings yet
BE EXPERT IN JAVA Part- 2: Learn Java programming and become expert
From Everand
BE EXPERT IN JAVA Part- 2: Learn Java programming and become expert
Ummed Singh
No ratings yet
Ruby Gems Mastery: 100 Essential Packages for 2024
From Everand
Ruby Gems Mastery: 100 Essential Packages for 2024
Kanto
No ratings yet
Just the basics of JavaScript
From Everand
Just the basics of JavaScript
Tom Henricksen
No ratings yet
Beyond the Basics of JavaScript
From Everand
Beyond the Basics of JavaScript
Tom Henricksen
No ratings yet
Ian Talks JS A-Z: WebDevAtoZ, #1
From Everand
Ian Talks JS A-Z: WebDevAtoZ, #1
Ian Eress
No ratings yet
PROJ 2 String Handling
No ratings yet
PROJ 2 String Handling
14 pages
Adobe Photoshop Interface
No ratings yet
Adobe Photoshop Interface
4 pages
C# Package Mastery: 100 Essentials in 1 Hour - 2024 Edition
From Everand
C# Package Mastery: 100 Essentials in 1 Hour - 2024 Edition
Tenko
No ratings yet
JavaScript Patterns JumpStart Guide (Clean up your JavaScript Code)
From Everand
JavaScript Patterns JumpStart Guide (Clean up your JavaScript Code)
Dan Wahlin
4.5/5 (3)
Learn JavaScript in 24 Hours
From Everand
Learn JavaScript in 24 Hours
Alex Nordeen
3.5/5 (5)
10 Lessons in Front-end
From Everand
10 Lessons in Front-end
Krasimir Tsonev
2/5 (1)
50 Recipes for Programming Node.js
From Everand
50 Recipes for Programming Node.js
Jamie Munro
3/5 (4)
120 Advanced JavaScript Interview Questions
From Everand
120 Advanced JavaScript Interview Questions
Hernando Abella
No ratings yet
Java Multithreading Interview Questions And Answers
From Everand
Java Multithreading Interview Questions And Answers
John Edward Cooper Berg
No ratings yet
UNIX Shell Programming Interview Questions You'll Most Likely Be Asked
From Everand
UNIX Shell Programming Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet

Public Class Public Static Int : Scanner - Inner - Buffer 1024 Delimiter

Uploaded by

Public Class Public Static Int : Scanner - Inner - Buffer 1024 Delimiter

Uploaded by

/**

You might also like